Gene Acid345_2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2904 
Symbol 
ID4071205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3447333 
End bp3448973 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content58% 
IMG OID637984922 
Productglycosyl transferase family protein 
Protein accessionYP_591979 
Protein GI94969931 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCCCC TGAGCACCGA AATCCTGGCA GCTTTCTCTG TCTCACAGGC ACACGGAGCG 
CAACATTGGA TCCGCACGCA CGTCCTCGAC ACCACGTTTA AAGGCCTTTA CCAAGCCAAC
GCCTTCGACC TCTGCCTTCT CATTCCGTAC TTCATCGTCC TCATTATTCT TGCCGCGTAT
GGCGTGCATC GGTACCAGCT CGTCTGGATG TACTACCGCA ATCGCAAGAA TAAGACGACT
GACCCGCCGC AGCACTTCGC CGAGTTGCCG CGCGTCACCG TGCAGTTGCC GATCTTCAAC
GAACAGTACG TCATTGACCG CCTCGTAGAA GCCGTTTGCA AGCTCGACTA CCCGAAGGAC
AAGCTCGACA TCCAGGTCCT CGACGACTCC ACCGACGAGA CCGTCGAAGT TGCGCGCGAG
GTGGTGGAGC GCTATGCCGC GCTTGGCAAC CCGATCTCTT ATATTCATCG GACGAACCGC
CACGGCTTCA AGGCGGGCGC ACTTCAGGAA GGTATGGCCG TCTGCAAGGG CGAGTTCATC
GCCATCTTCG ACGCCGACTT CGTGCCGCCC GCAGACTTTC TACAGAAGTG CATTCACCAC
TTCGCCGAGC CGGAAATCGG TATGGTGCAA ACGCGCTGGA CGCACCTGAA CCGCAACTAC
TCGTTCCTCA CCGAAGTTGA GGCCATCCTC CTTGACGGCC ACTTCGTGCT TGAGCACGGC
GGCCGCTCCC GCAAGGGCGT CTTTTTCAAC TTCAACGGCA CCGCCGGCAT GTGGCGCAAG
CAGGCCATTG AAGAAGCTGG TGGCTGGCAG CACGACACCC TGACCGAAGA CACCGATCTC
AGCTATCGCG CGCAGGTAAA GGGTTGGCGG TTCAAGTATC TGCAGGATGT CGAGTGCCCC
GCGGAATTGC CGATCGAAAT GACGGCCTTC AAAACCCAGC AGGCGCGTTG GGCGAAGGGG
CTTATCCAGT GCTCGAAAAA AGTGTTGCCG TTCTTGTACC GCAGCGACGT GCCGCGGCGC
GTAAAAGTCG AAGCCTGGTA TCACCTCACC GCCAACATTA GTTATCCGCT GATGATCGTT
CTATCGGCCC TCATGCTTCC GGCGATGGTG CTGCGCTTCT ACCAGGGCTG GTTTCAAATG
CTCTACATTG ATATGCCGCT GTTCCTGGCA TCCACGTTCA GCATCTCGAG CTTCTATCTA
GTCTCGCAAA AAGAACTCTA TCCAAAGACG TGGCTGCGGA CATTCATGTA TCTGCCCGCA
CTCATGGCGC TCGGGATCGG CCTGACGGTG ACGAATACAA AGGCCGTGCT GGAAGCCATC
GTCGGCAAGC AGTCGGCCTT CGCACGTACG CCTAAATATC GCGTCACCAA CAAGGGCGAG
AAATCCATCG CCGCAAAGAA GTATCGCAAG CGCCTCGGCA TCATTCCCTG GATCGAACTG
GCGATCGGCA CGTGGTTCGC CGCGTGCGTG TGGTACGCCG TCAGCCGCGA GAACTACATT
ACAGTTCCCT TCCTCTGCTT GTTTGTCTTC GGTTACTGGT ACACAGGACT GATGTCACTC
CTGCAAGGCC GCTTCGATTC GCTCATGGGC CGCACCGCCA GCCCGGAAAC CCACACCAAG
CCCTTCCCCG TCGGCGTGTA G
 
Protein sequence
MRPLSTEILA AFSVSQAHGA QHWIRTHVLD TTFKGLYQAN AFDLCLLIPY FIVLIILAAY 
GVHRYQLVWM YYRNRKNKTT DPPQHFAELP RVTVQLPIFN EQYVIDRLVE AVCKLDYPKD
KLDIQVLDDS TDETVEVARE VVERYAALGN PISYIHRTNR HGFKAGALQE GMAVCKGEFI
AIFDADFVPP ADFLQKCIHH FAEPEIGMVQ TRWTHLNRNY SFLTEVEAIL LDGHFVLEHG
GRSRKGVFFN FNGTAGMWRK QAIEEAGGWQ HDTLTEDTDL SYRAQVKGWR FKYLQDVECP
AELPIEMTAF KTQQARWAKG LIQCSKKVLP FLYRSDVPRR VKVEAWYHLT ANISYPLMIV
LSALMLPAMV LRFYQGWFQM LYIDMPLFLA STFSISSFYL VSQKELYPKT WLRTFMYLPA
LMALGIGLTV TNTKAVLEAI VGKQSAFART PKYRVTNKGE KSIAAKKYRK RLGIIPWIEL
AIGTWFAACV WYAVSRENYI TVPFLCLFVF GYWYTGLMSL LQGRFDSLMG RTASPETHTK
PFPVGV