Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2904 |
Symbol | |
ID | 4071205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3447333 |
End bp | 3448973 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984922 |
Product | glycosyl transferase family protein |
Protein accession | YP_591979 |
Protein GI | 94969931 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCCCC TGAGCACCGA AATCCTGGCA GCTTTCTCTG TCTCACAGGC ACACGGAGCG CAACATTGGA TCCGCACGCA CGTCCTCGAC ACCACGTTTA AAGGCCTTTA CCAAGCCAAC GCCTTCGACC TCTGCCTTCT CATTCCGTAC TTCATCGTCC TCATTATTCT TGCCGCGTAT GGCGTGCATC GGTACCAGCT CGTCTGGATG TACTACCGCA ATCGCAAGAA TAAGACGACT GACCCGCCGC AGCACTTCGC CGAGTTGCCG CGCGTCACCG TGCAGTTGCC GATCTTCAAC GAACAGTACG TCATTGACCG CCTCGTAGAA GCCGTTTGCA AGCTCGACTA CCCGAAGGAC AAGCTCGACA TCCAGGTCCT CGACGACTCC ACCGACGAGA CCGTCGAAGT TGCGCGCGAG GTGGTGGAGC GCTATGCCGC GCTTGGCAAC CCGATCTCTT ATATTCATCG GACGAACCGC CACGGCTTCA AGGCGGGCGC ACTTCAGGAA GGTATGGCCG TCTGCAAGGG CGAGTTCATC GCCATCTTCG ACGCCGACTT CGTGCCGCCC GCAGACTTTC TACAGAAGTG CATTCACCAC TTCGCCGAGC CGGAAATCGG TATGGTGCAA ACGCGCTGGA CGCACCTGAA CCGCAACTAC TCGTTCCTCA CCGAAGTTGA GGCCATCCTC CTTGACGGCC ACTTCGTGCT TGAGCACGGC GGCCGCTCCC GCAAGGGCGT CTTTTTCAAC TTCAACGGCA CCGCCGGCAT GTGGCGCAAG CAGGCCATTG AAGAAGCTGG TGGCTGGCAG CACGACACCC TGACCGAAGA CACCGATCTC AGCTATCGCG CGCAGGTAAA GGGTTGGCGG TTCAAGTATC TGCAGGATGT CGAGTGCCCC GCGGAATTGC CGATCGAAAT GACGGCCTTC AAAACCCAGC AGGCGCGTTG GGCGAAGGGG CTTATCCAGT GCTCGAAAAA AGTGTTGCCG TTCTTGTACC GCAGCGACGT GCCGCGGCGC GTAAAAGTCG AAGCCTGGTA TCACCTCACC GCCAACATTA GTTATCCGCT GATGATCGTT CTATCGGCCC TCATGCTTCC GGCGATGGTG CTGCGCTTCT ACCAGGGCTG GTTTCAAATG CTCTACATTG ATATGCCGCT GTTCCTGGCA TCCACGTTCA GCATCTCGAG CTTCTATCTA GTCTCGCAAA AAGAACTCTA TCCAAAGACG TGGCTGCGGA CATTCATGTA TCTGCCCGCA CTCATGGCGC TCGGGATCGG CCTGACGGTG ACGAATACAA AGGCCGTGCT GGAAGCCATC GTCGGCAAGC AGTCGGCCTT CGCACGTACG CCTAAATATC GCGTCACCAA CAAGGGCGAG AAATCCATCG CCGCAAAGAA GTATCGCAAG CGCCTCGGCA TCATTCCCTG GATCGAACTG GCGATCGGCA CGTGGTTCGC CGCGTGCGTG TGGTACGCCG TCAGCCGCGA GAACTACATT ACAGTTCCCT TCCTCTGCTT GTTTGTCTTC GGTTACTGGT ACACAGGACT GATGTCACTC CTGCAAGGCC GCTTCGATTC GCTCATGGGC CGCACCGCCA GCCCGGAAAC CCACACCAAG CCCTTCCCCG TCGGCGTGTA G
|
Protein sequence | MRPLSTEILA AFSVSQAHGA QHWIRTHVLD TTFKGLYQAN AFDLCLLIPY FIVLIILAAY GVHRYQLVWM YYRNRKNKTT DPPQHFAELP RVTVQLPIFN EQYVIDRLVE AVCKLDYPKD KLDIQVLDDS TDETVEVARE VVERYAALGN PISYIHRTNR HGFKAGALQE GMAVCKGEFI AIFDADFVPP ADFLQKCIHH FAEPEIGMVQ TRWTHLNRNY SFLTEVEAIL LDGHFVLEHG GRSRKGVFFN FNGTAGMWRK QAIEEAGGWQ HDTLTEDTDL SYRAQVKGWR FKYLQDVECP AELPIEMTAF KTQQARWAKG LIQCSKKVLP FLYRSDVPRR VKVEAWYHLT ANISYPLMIV LSALMLPAMV LRFYQGWFQM LYIDMPLFLA STFSISSFYL VSQKELYPKT WLRTFMYLPA LMALGIGLTV TNTKAVLEAI VGKQSAFART PKYRVTNKGE KSIAAKKYRK RLGIIPWIEL AIGTWFAACV WYAVSRENYI TVPFLCLFVF GYWYTGLMSL LQGRFDSLMG RTASPETHTK PFPVGV
|
| |