Gene Acid345_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3807 
Symbol 
ID4071091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4500115 
End bp4501596 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content55% 
IMG OID637985830 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_592881 
Protein GI94970833 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.27586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.25829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAT CGGCGACTCC AGGACGTTCT TCTCCGCAAA CCAAACCTCC CATACCCCGG 
GACCTCAGTT CTCGGTTGGA AGCACCTCGG GCGAGGCACG GGAACGCGCT AAAAGTTGTA
ATCGAAGTCC TGGAGCGCCT TTTTGATGCG GTCGCTGTGT TCACCGGGAC CGTTCTGGCA
TACAACGTTT ACCGATTGGT TGGACTCGGT CGGAGGGTCG AGTATCCAGT CGGTTGGCTG
ACAACTGCTG CGATTGCGTT TTCGGTCGTG TTCATTTTGT TGCTGGAACA CCTGGGTGAG
TATCGTTCCG GGAGCCTTCT CGGTGTGCGT GAGACCGAAC GAGTCCTGCG CGCTGCTTGG
TACTCTGTTT TTCTCGTTTT CCCAATCGCT TTTTTTTCGG GGCGGTCTTA CTCAAGACTT
GTGGTTTTGT TCGTGGCGGC AGTCGTTCCG ATCTGTGTCC TTATCGTCCG ACAGTTTTCG
TTCCGAGTGA TTGAGCGGCT TTGTATCGCC GGAGATCTCT TGCGACCGGC GGCCATTTAC
GGGAGCGGCA GATCCGCGAG GAAGGTTTAC TCGGCGCTGC TGCGCTCGCC GAAACTCGGG
CTGAAGCCGG TGGCATTCCT CGATGAGAAG GAGACGGATC GGTCAAAGCA TGTGTTCGAG
GCCTCGTATC ACCGAAAGAA CAGTGCGCAA GTGATCGCGA TGCCGATCAC GATCGAGTTG
CTCAAGGACT TGCAGATCAG CAGCCTCGTG CTTTGCGAAT TACCGCCGGC TTCCAAACTT
TCAGTCGTCG AAGAGTGCTG CCGACAGGCT GGCGTAGAGG TTTATCTGTC GCCAGGCCTG
ATGCATTCTG AAGATCAATC CATTGAATTC GTTGACATTG ACGGAATGTT GTTCGCTCGC
ACTGCGGGAG TGAACCATGT ATCGCTATAT GATTTGTTCA AGCGTGTCTT CGATGTGCTG
GCGGCCGGAG TTATCACGCT GGTGATATCG CCGATCCTCC TCGCGATTGC CCTCGCGGTT
AAGTTTTCGT CTCGCGGCAA AATACTATTT GTGCAAGAGC GAGTCGGGCG CGGCGGTACG
CAGTTCGCAA TGTACAAGTT TAGGTCGATG CATGCCGATG CACCGAAATA TGCGTTTTCG
CCACGTGAGG TAGAAGATCC GCGCATCACG CTCATTGGGC GTTTCTTGCG GCGCACAAGC
CTCGATGAAT TGCCTCAACT ATTTAATGTA ATTCGGGGCG AGATGTCGCT GGTGGGGCCA
CGACCAGAAA TGCCGTTCAT CGTGGAGAGG TACACCCCGC AGCAGCGACG AAGATTAGAT
GTGAAGCCGG GAATCACAGG CTTGTGGCAA CTCAGCGCCG ATCGCGCTTA CCTGATTCAC
GAGAACATCG AGTACGATCT TTACTACGTT AGGAACCGAA ACTTTTTCCT GGACCTCGCG
GTTCTAGTCC ATACGGCAGT TTTTGCAGCC CGGGGAGTCT GA
 
Protein sequence
MATSATPGRS SPQTKPPIPR DLSSRLEAPR ARHGNALKVV IEVLERLFDA VAVFTGTVLA 
YNVYRLVGLG RRVEYPVGWL TTAAIAFSVV FILLLEHLGE YRSGSLLGVR ETERVLRAAW
YSVFLVFPIA FFSGRSYSRL VVLFVAAVVP ICVLIVRQFS FRVIERLCIA GDLLRPAAIY
GSGRSARKVY SALLRSPKLG LKPVAFLDEK ETDRSKHVFE ASYHRKNSAQ VIAMPITIEL
LKDLQISSLV LCELPPASKL SVVEECCRQA GVEVYLSPGL MHSEDQSIEF VDIDGMLFAR
TAGVNHVSLY DLFKRVFDVL AAGVITLVIS PILLAIALAV KFSSRGKILF VQERVGRGGT
QFAMYKFRSM HADAPKYAFS PREVEDPRIT LIGRFLRRTS LDELPQLFNV IRGEMSLVGP
RPEMPFIVER YTPQQRRRLD VKPGITGLWQ LSADRAYLIH ENIEYDLYYV RNRNFFLDLA
VLVHTAVFAA RGV