Gene Acid345_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2833 
Symbol 
ID4071836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3370372 
End bp3371718 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content58% 
IMG OID637984851 
ProductUDP-glucose/GDP-mannose dehydrogenase 
Protein accessionYP_591908 
Protein GI94969860 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCGC AGCTCGGCAC GTTAGCCACG GAATTGAAAC GCAAAATCGA AGCTCGGGAA 
GCCCGCATCG GCATTGTCGG CATGGGATAC GTTGGATTGC CGCTTGCCCT GTTGTTTAGC
GAAGAGAAGT TTCGCGTAAC CGGGTTCGAC ATTGATAACC GGAAGGTCGA GGCACTGAAC
GCCGGCGGGT CGTATATCGT GCGGATTCCG GGCACCGAAA TCCAGGGCGC CCAGAAGAGC
GGATTCTCTG CCACCGCCGA TTACGGCAAG ATCTCCGAGA TGGACGCGGT GATCATCTGT
GTGCCGACGC CGCTCAACGA GTTCCACGAG CCGGACCTCA GCTACATCAC GCAAACCGTG
GATGCGATCG CACCTCGCCT GCGCGAAGGG CAGATCGTCA TCCTGGAGAG CACGACCTAT
CCCGGCACGA CCGAAGAGGT TGTGGTGCCA CTGCTCGAAA AGGGCAACGC CAAGGGATTG
AAGGTTGCGC GCGCCGAGGA CGAAGGCGAC TTCTTTGTTG CCTTCTCTCC GGAGCGCGAA
GATCCGGGCA ACGACACCGT GGCGCGTCGT GACATTCCCA AGGTTGTCGG CGGCGTTGGC
AAGCTTGCGT CTGAAATCGC GGCAGCCGTG TATGGCACGA TTTTCAACCG CACGGTACCG
GTCTCATCGC CAGCGGCAGC GGAAATGACC AAGCTGCTGG AAAACATCTA TCGCTGCGTG
AACATCGCGC TGGTCAACGA GTTGAAGCAG CTCTGCCACC GCATGGACAT TGATATTTTC
GAGGTCATCG ACGCAGCGAA GACCAAGCCC TTCGGCTTCC AGGCGTTCTA TCCGGGGCCA
GGTTTGGGCG GTCACTGCAT TCCGATCGAT CCGTTCTATC TCTCGTGGAA AGCGAAGCAG
TTCGACTTCC GCACCAAGTT CATCGAACTC GCCGGCGAAG TCAACATTGC AATGCCGTAT
TACGTGATTG ATAAGACCGT CGAGGCGCTG AACCAGCACA AGAAGTCGCT GAACGGTTCG
AAGGTCCTCG TGCTTGGACT TGCGTACAAG AAGGACATTG ACGACCTGCG CGAGTCACCC
TCGTTGACGA TCATCGAGCT GCTGCGCAAG GGCGGGGCCG AAGTTTTCTA CAACGATCCG
TTCTTCGCGA AGGTCGGACA CGGGCGCCAT TACGACCTGA ACATGACGAA CACACCGCTG
GAAAATCTTG GACAGTACGA CGCGGTGCTG ATCGTGACCG ACCACTCGGA TTACGACTAC
CAGCGCATCG TGAAAGAGTC GAAGCTGGTG GTGGATTCGC GCAACGCGAC AAAGGGGATC
ACGTCGGAAA AGATCGTTCG CTGCTAA
 
Protein sequence
MKSQLGTLAT ELKRKIEARE ARIGIVGMGY VGLPLALLFS EEKFRVTGFD IDNRKVEALN 
AGGSYIVRIP GTEIQGAQKS GFSATADYGK ISEMDAVIIC VPTPLNEFHE PDLSYITQTV
DAIAPRLREG QIVILESTTY PGTTEEVVVP LLEKGNAKGL KVARAEDEGD FFVAFSPERE
DPGNDTVARR DIPKVVGGVG KLASEIAAAV YGTIFNRTVP VSSPAAAEMT KLLENIYRCV
NIALVNELKQ LCHRMDIDIF EVIDAAKTKP FGFQAFYPGP GLGGHCIPID PFYLSWKAKQ
FDFRTKFIEL AGEVNIAMPY YVIDKTVEAL NQHKKSLNGS KVLVLGLAYK KDIDDLRESP
SLTIIELLRK GGAEVFYNDP FFAKVGHGRH YDLNMTNTPL ENLGQYDAVL IVTDHSDYDY
QRIVKESKLV VDSRNATKGI TSEKIVRC