Gene Acid345_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0361 
Symbol 
ID4069603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp400022 
End bp401692 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content59% 
IMG OID637982364 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_589440 
Protein GI94967392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.252753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGATG ACCAGCGGCA GTTTGATTTC GACTTCATCG TCATCGGCTC AGGATTTGGC 
GGAAGTGTCT CCGCGCTGCG ACTCACGGAA AAGGGCTACA AGGTCGCAGT GATGGAGATG
GGGCGTCGCT GGACGCCCGA CAACCTTCCG AAGACCAATT GGTCGCTCGC GCGCTGGTTC
TGGCGACCGG GGCTCGGGCT GCGCGGCTTC TTCAGCATGA GGTTTTTCAG TCGCGTCACA
ATCTTGCATG GATGCGCGGT GGGCGGCGGC TCCATCACCT ATGCCAGCAC GCTGCTGCGT
GCGCCGGATA AAGTATGGGA CAGCGGCACC TGGAAGGGAC TGTCGAATTG GAAGTCGGAG
ATGCCGCGCC ACTACGAGAC GGCGTCGCGC ATGCTCGGTG TTACTCAGAA CAAGATCCTC
GGGCCCGCCG ATCATCTGCT GAAGCAAGTT GCCGTCGCCT CCGGAGCAGG CGAGACGTTT
TACCGCACCA ACGTCGGCAT TTTCCAGGCG CCCGAAGGCG AAGCTGGTGG ACTGACCTAT
GCCGATCCGT ACTTCGGTGG CGAAGGTCCA GCGCGCACCA CCTGCAACGC CTGTGGTGGC
TGCATGATCG GTTGCCGTCA CGGCGCGAAG AACACGCTCG ATCTTAACTA TCTCTACCTC
GCTGAGAAGC GCGGTATGAA GATCTTCGCG GAGACGCGTG TGGTGGACGT TCAGCCGCTT
GGCGCGGTGG ATGGCAGAGA AGGGTACGAA GTCACTACTG AACGCTCGAC CTCATTCGTG
TTCAAGAACC GGCAACGCTT CACGTGTCGG GGTATTGTGT TCTCGGCATC GTCACTCGGT
ACGACGGAGC TTCTCTTCCG CCTGAAGACG AAGCATTCGC TGCCAAACAT CAGCGATCAG
CTCGGCAATC GTGTGCGTAC GAATAGCGAA TCACTCATCG GAGTGCGGGT GCCGAAATCC
GAGCAAGATC TTTCTCGCGG GGTTGCGATC GGTTCGGGCG TTTACATCGA CGACCACACG
CACATTGAAG CCGTGCGTTA TCCCAAAGGT TCCGATGTCA TGGGCGGCCT TGCAACCACT
CTTACTGCGG GCAAGCCTGG CATTGGACGC ATTGCGCTCT GGTTCAAGAA CTTGCTGGTC
TCATTCTGCA CGCATCCGGT GCGCACCGTT CGACTGCTTC AGCCCTTCGG TTTCGCGCGC
GAATCCGTCA TCCTGCTCTG CATGCAGGCG CTGGAGGGAC ACATTGATAT GCGGTGGAAA
CGTCCCTGGT ATTGGCCTGT TCGCCGCGTG CTCGTCAGCA GCGGACAGCG CATCCCAACC
TTCATTCCTG CGGCCAATCA GTTCGCGCAG GTATTCGCCA AGATGGCGGG TGGCACCGCG
ATGAGTATGT TGCCGGAGAT TCTCTTCAAT ATTCCCGCGA CCGCCCATTG CCTCGGTGGC
GCTGTAATCG GCGCATCGCC GGTGGACGGC GTGATTGACG CGCGGCACCG CGTCTTTGGC
TACACCAATA TGTATGTTTG CGATGGCTCT GTCGTTGCCG CAAACCTCGG CGTCAATCCC
AGCCTGACGA TTACGGCGTT GGCGGAGCGC GCGATGGAGT TCATTCCACT CGCGAGCGCG
CATACGTGGA CCGATCGCGC TGATTCCATC GAAGTCTCTA AAGCTGTCTA G
 
Protein sequence
MQDDQRQFDF DFIVIGSGFG GSVSALRLTE KGYKVAVMEM GRRWTPDNLP KTNWSLARWF 
WRPGLGLRGF FSMRFFSRVT ILHGCAVGGG SITYASTLLR APDKVWDSGT WKGLSNWKSE
MPRHYETASR MLGVTQNKIL GPADHLLKQV AVASGAGETF YRTNVGIFQA PEGEAGGLTY
ADPYFGGEGP ARTTCNACGG CMIGCRHGAK NTLDLNYLYL AEKRGMKIFA ETRVVDVQPL
GAVDGREGYE VTTERSTSFV FKNRQRFTCR GIVFSASSLG TTELLFRLKT KHSLPNISDQ
LGNRVRTNSE SLIGVRVPKS EQDLSRGVAI GSGVYIDDHT HIEAVRYPKG SDVMGGLATT
LTAGKPGIGR IALWFKNLLV SFCTHPVRTV RLLQPFGFAR ESVILLCMQA LEGHIDMRWK
RPWYWPVRRV LVSSGQRIPT FIPAANQFAQ VFAKMAGGTA MSMLPEILFN IPATAHCLGG
AVIGASPVDG VIDARHRVFG YTNMYVCDGS VVAANLGVNP SLTITALAER AMEFIPLASA
HTWTDRADSI EVSKAV