Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1583 |
Symbol | |
ID | 4069021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1931415 |
End bp | 1932992 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983592 |
Product | hypothetical protein |
Protein accession | YP_590659 |
Protein GI | 94968611 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.067578 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGCC CTCCCTCCAA TCTCGACCAG CCATTGCAGT CTCCGCCTTC AGTCCGGGAA CACCTGTCGG CGTTCTCGCT GATCTTCGTC GCGCTCTTCC TCAGCCACGG CGCACTCTTA CGGCTTCCCT ATTTCTGGGA TGAGGCTGGG TATTACATTC CTGCAGCACG CGACCTTCTA ATTACCGGCG ATTTGATTCC GCACTCGACG CTTTCGAACG CGCATCCGCC ACTGCTGGCG ATTTACCTCG CAACCTGGTG GAAGCTGAGC GGCTTCACGC CCGCAGTGAC GCGAATCGCC TTACTCATCG TCACCTCGTT CTCGCTCCTT GGGCTCTGGC GACTCGCGTC TGTAGTGGCG AACCGTACGG TTGCGGTGAT CGCCGTCCTC TGCACTGCCG CGTTTCCGGT TTTCTTCGCG CAGAGTTCTC TCGCCCACCT CGACATGCTG GTGACCGCAT TCACCATCTG GGGAGTGGCC TTCTACTTGG AAGGCCGCAA TGTTCGTGCG GTAATCATGT TCGCCTTCGC GGGACTTTCA AAAGAAACGG CGATTCTTAC ACCCGGCGTT CTCATCGCAT GGGAGGTATT CGCTCCGGCA AGATGGCGCA CCGGTCCGCG AAACTGGAAG CGCGCGGTGC GGCTGCTGCT GAGCTTCGTT CCGCTGGCGA TTTGGTTTCT GTATCTGCAC CATCGCACCG GCTTTTACAC TGGCGATCCG TACTACTACA GCTATAACGT CGGCGCGACG CTCACTCCAC TCCGCATCGT CCTCGCCATC GTGCTCCGTT TGTGGCACAC GCTCGGCTAT ATGAACCTGT TCGTGCTCAC GCTGCTCACG TTGGCTGCAA TGCTCGAACC CGCCGTGGTG GATGGCACGG CCGCACGACG CCGTATCGCA CTCCCCGTGC AGATGGTTCT GTACGCGGTA ATCGCCGCAC ACGTGGTGTC GCTTGCGATT GCAGGCGGCG CGGTGCTCGC ACGCTACATG TTGCCGGTCT ATCCGCTGAT TGTGCTGGTC TGCGTGAGCA CGCTCTATCG ACGCTTGCGC TGGTGGCCGG TCGCAACCGC GGTCGTCGTT GCGTCGTTTC TGGCAGGCCT GGTGTTCAAT CCGCCGTACC GCTTCGCGCC GGAAGACAAT CTCACCTATC GCGACTACAT CCTCCTGCAT CGTGGCGCTG CAACGTATCT CTCGCAGCAC GCACACGGCG CACATGTCCT GACGGCATGG CCTGCCTCTG ACGAAATCTC ACGGCCGTTT CTTGGATACC TGAAAGAGCC CATCCCTGTC GTCCGCATCG AGAACTTCAC CGCGGCGCAG ATGACGCTCG CCGCCGCTGC GCAAGGCCAG TACGACTGGG TCTATCTCTT CTCAACGAAA TATGAGCCTC CACACCTGCT TATTCATTCA GCCTACTGGG AGGGCATGCA GAAGCGGTTC TTCGATTACC ACATCGACAT CTCGCCCGAA GTGGCAGCGC GCATGGTCGG CGGACGCATT GTGTACCAGT CCCATCGCCG CGGGGAATGG GTGGCTCTGG TGCAGATCGA GCACGCGGAG AACGCGCGGC TTCGCTAG
|
Protein sequence | MSSPPSNLDQ PLQSPPSVRE HLSAFSLIFV ALFLSHGALL RLPYFWDEAG YYIPAARDLL ITGDLIPHST LSNAHPPLLA IYLATWWKLS GFTPAVTRIA LLIVTSFSLL GLWRLASVVA NRTVAVIAVL CTAAFPVFFA QSSLAHLDML VTAFTIWGVA FYLEGRNVRA VIMFAFAGLS KETAILTPGV LIAWEVFAPA RWRTGPRNWK RAVRLLLSFV PLAIWFLYLH HRTGFYTGDP YYYSYNVGAT LTPLRIVLAI VLRLWHTLGY MNLFVLTLLT LAAMLEPAVV DGTAARRRIA LPVQMVLYAV IAAHVVSLAI AGGAVLARYM LPVYPLIVLV CVSTLYRRLR WWPVATAVVV ASFLAGLVFN PPYRFAPEDN LTYRDYILLH RGAATYLSQH AHGAHVLTAW PASDEISRPF LGYLKEPIPV VRIENFTAAQ MTLAAAAQGQ YDWVYLFSTK YEPPHLLIHS AYWEGMQKRF FDYHIDISPE VAARMVGGRI VYQSHRRGEW VALVQIEHAE NARLR
|
| |