Gene Acid345_2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2658 
Symbol 
ID4071912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3131906 
End bp3133567 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content61% 
IMG OID637984675 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_591733 
Protein GI94969685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGGAC CACAGTTTCG AAACTCTGAA ACCGTTGACT TCATCGTGGT AGGCGCGGGA 
GCGGCCGGCG GTGTTATGGC GAAGGAGCTG GCCGTCGCGG GCTTCAGCCT GGTGGTGCTG
GAGCAGGGGC CGTATCTGAG CGAGGAGGAC TTCGGACACG ACGAGATCAA GTTCGCCATC
CAGAAGAAAC TGACCAACGA TACGAAGATC CAGCCCATCA CCTATCGCAA GACCGAAGCC
GAGGTTGCGA AGCCGTTCAA GGCCATTGAA TACGGGCGGC AGGTGGGCGG CGGGTCGGTA
CACTTCACGG CGAATTACTG GCGTCTGCAT GAGAGCGATT TCCACGAGCG CAGTCTTTGG
GGCGAGGTGC AAGGCTCGAC GTTCGACGAC TGGCCGATTC GCTACGGCGA CCTGGAACCC
TATTACACCA AGGCGGAAGA GGAGCTGGGC ATCTCGGGCC TGGGTGGGGC GAACCCGTTC
GAGGCGCCGC GGTCGAAGCC GTATCCGCTG CCGCCGATGC CGGTCAAGTC CTCGGGTGTG
CTGTTCGAGC GGGCGACGAA GAAGATGGGG CTGCATCCGT ATCCTGCGCC GGTGGCGGTA
CTCTCGCAGC CGTATCGCGG AAGCGGCGCG TGCGTGCACT GCGGCATGTG TGAACTCTTT
GGCTGCGAGA TGAAAGCGAA GTCGAGCACG CTGGTCAGCG TCATTCCGAT TGCGGAGAAG
AGCGGACGCT GCGAGATCCG CCCCAATTCC TACGTGCGCA AGTTGGAAAC CGACGCCTCC
GGCCGCGTAA CCGGGGTGAT CTATTTCGAC GCGCAGAAGC AAGAGGTGCT GCAGCGCGCC
AAGGCGGTGG TGCTTTGCGC CAACGGTGTG GAGTCGGCGA AGCTGCTGCT GATGTCGAAG
TCGAACCGGT TCCCGCAGGG GCTCGCGAAT TCAAGCGGGC TGGTGGGCAA GAACCTGATG
TGGGACAACG GCACGGAATC GAGCGGGCTG TTTGAACACC CGCTGAACGA ATTCAAGAGC
GTGCAGGTCA CGCGCGTGAT TCACGATTAC TACGATGCCG ATCGAAAGCG AGGCTTTTAC
GGCGGCGGAG GCATCGACGC GCGCTTCGAT TTCTATCCCA TCACCTTCGC GCTTACCGGG
CTGCCGGACG ATGCTCCGAC CTGGGGACTG GAATTCAAGA AAACGGTTGG CAAGTACTTC
ACGCGTACCA TGACTCTGCT CGCGCATGCC ACTTCGCTGC CGCGCGAAAC GAACAGTGTC
TCGCTCGACC CGCAGATGAA AGATGCATGG GGATTGCCGG CGGTGCGCAT CACCTTCGAC
TGGCATCCGG ATGATATCGC GAACATGAAG TGGCTCGTCG AAAGGGAACG CGAGATTTTG
CAGGTAGCGG GAGCACAGAA AGTATGGTCG TTCCCGGTGG AACCGGCGCA GCCGAACCTG
ATGCCGTCTC GGCACCTGAT TGGAACTTGC CGCATGGGAC GCGACCCGAA AAAATCAGTC
GTGGATCCCT TCGGCCACGC CCACGATGTG CCGAACCTGT TCATCGTGGA CGGAAGCAAC
TTCGTGACCT CAGGACGACA GCAACCAACG GCGACGATCC AGGCGCTGGC TTATCGGGCG
GCAGAACGGA TTGCGGGGAA GGCTAAGGCG GGGGAGTTGT AG
 
Protein sequence
MKGPQFRNSE TVDFIVVGAG AAGGVMAKEL AVAGFSLVVL EQGPYLSEED FGHDEIKFAI 
QKKLTNDTKI QPITYRKTEA EVAKPFKAIE YGRQVGGGSV HFTANYWRLH ESDFHERSLW
GEVQGSTFDD WPIRYGDLEP YYTKAEEELG ISGLGGANPF EAPRSKPYPL PPMPVKSSGV
LFERATKKMG LHPYPAPVAV LSQPYRGSGA CVHCGMCELF GCEMKAKSST LVSVIPIAEK
SGRCEIRPNS YVRKLETDAS GRVTGVIYFD AQKQEVLQRA KAVVLCANGV ESAKLLLMSK
SNRFPQGLAN SSGLVGKNLM WDNGTESSGL FEHPLNEFKS VQVTRVIHDY YDADRKRGFY
GGGGIDARFD FYPITFALTG LPDDAPTWGL EFKKTVGKYF TRTMTLLAHA TSLPRETNSV
SLDPQMKDAW GLPAVRITFD WHPDDIANMK WLVEREREIL QVAGAQKVWS FPVEPAQPNL
MPSRHLIGTC RMGRDPKKSV VDPFGHAHDV PNLFIVDGSN FVTSGRQQPT ATIQALAYRA
AERIAGKAKA GEL