Gene TM1040_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1882 
Symbol 
ID4077379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1980763 
End bp1982418 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content63% 
IMG OID638007198 
Productcholine dehydrogenase 
Protein accessionYP_613877 
Protein GI99081723 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGG ATTATGTGAT TGTTGGGGCG GGCAGCGCAG GCTGCGCCAT GGCCTACCGT 
CTGAGCGAGG CGGGCAAATC GGTGCTGGTG ATCGAGCATG GTGGCACGGA TGCGGGCCCC
TTCATCCAGA TGCCCGGCGC ATTGAGCTAC CCCATGAACA TGTCGATGTA TGACTGGGGT
TACAAATCCC AGCCCGAGCC GCATCTGGGC GGGCGCGAGC TGGTGACGCC GCGCGGCAAG
GTCATTGGCG GATCCTCTTC GATCAACGGC ATGGTCTATG TGCGCGGCCA CGCGGGCGAC
TATAACCACT GGGCCGAAAC GGGCGCGACC GGCTGGTCCT ATGCCGACGT GCTGCCCTAT
TTCAAACGTA TGGAAACCTG GGATGATCGC GGCCATGGCG GCGATCCCGA CTGGCGCGGC
ACCGACGGCC CGCTGCATGT CACCCGTGGC CCCCGCGACA ACCCGCTACA TGATGCCTTT
GTGAAGTCCG GGCAGCAGGC GGGGTATCCG GTCACCAAGG ATTATAACGG CCAGCAGCAA
GAGGGCTTTG GCCCGATGGA GATGACCGTC CACAAGGGCC GCCGCTGGTC TGCCGCCAAT
GCCTATCTGA AACCTGCGCT CAAGCGCGAC AATTGCGATC TGATCCGCGC GCTGGCCCGC
AAGGTGGTGA TCGAGGATGG CCGCGCCGTC GGTGTCGAAG TCGAGCGCGG CGGCAAGATC
GAAGTCATCC GCGCCAATAT CGAGGTGATC CTCGCGGCGT CTTCGCTCAA CTCGCCCAAG
CTCCTGATGC TCTCGGGCAT TGGCCCCGCC GCACATCTGG CCGAACATGG CATCGACGTC
ATCGCGGACC GGCCCGGCGT TGGCCAGAAC CTGCAGGACC ATCTGGAGTT CTATTTCCAG
TTTGCCTCCA AGAAGCCGAT CACGCTCTAT AAATACTGGA ACCTCTTCGG CAAGGCCTTG
GTCGGGGCGC AGTGGCTCTT TACCAAGACC GGGCTCGGGG CCTCGAACCA GTTCGAGAGC
GCGGCCTTCA TTCGCTCGGA CAAGGGGATC GACTATCCCG ACATCCAGTA TCACTTCCTG
CCGATCGCCG TGCGCTATGA CGGGCAGGCG GCGGCCGAGG GCCACGGCTT TCAGGCCCAT
GTCGGCCCGA TGCGCTCGCA GTCGCGCGGC GAGGTAACGC TGGCCAGCGC CGATCCCAAC
GCCGCGCCAA AGATCCTGTT CAACTACATG TCTACCGAGC AGGACTGGAT CGATTTCCGC
AAATGCGTCC GCCTCACGCG TGAGATCTTT GCACAGGATG CGATGAAGCC TTTTGTGAAA
CACGAGATCC AGCCGGGCAC CGACCTGCAA ACGGACGAGG AGATCGACGG ATTCCTGCGC
GAACATGTCG AGAGCGCCTA TCACCCCTGC GGCACCTGCA AGATGGGTGC GGTGGATGAT
CCGATGGCGG TGGTTGACCC CGAATGCCGG GTGATTGGCG TCGAGGGGCT GCGGGTGGCG
GATAGTTCGA TCTTCCCGCG CATCACCAAC GGCAACCTCA ACGGGCCCTC GATCATGACC
GGCGAGAAAG CCTCCGATCA CATTCTGGGG CGCCGTCTGC CTTCGTCGAA TGCCGAGCCG
TGGTTCAACC CGAACTGGCA GACCTCGCAG CGTTGA
 
Protein sequence
MNADYVIVGA GSAGCAMAYR LSEAGKSVLV IEHGGTDAGP FIQMPGALSY PMNMSMYDWG 
YKSQPEPHLG GRELVTPRGK VIGGSSSING MVYVRGHAGD YNHWAETGAT GWSYADVLPY
FKRMETWDDR GHGGDPDWRG TDGPLHVTRG PRDNPLHDAF VKSGQQAGYP VTKDYNGQQQ
EGFGPMEMTV HKGRRWSAAN AYLKPALKRD NCDLIRALAR KVVIEDGRAV GVEVERGGKI
EVIRANIEVI LAASSLNSPK LLMLSGIGPA AHLAEHGIDV IADRPGVGQN LQDHLEFYFQ
FASKKPITLY KYWNLFGKAL VGAQWLFTKT GLGASNQFES AAFIRSDKGI DYPDIQYHFL
PIAVRYDGQA AAEGHGFQAH VGPMRSQSRG EVTLASADPN AAPKILFNYM STEQDWIDFR
KCVRLTREIF AQDAMKPFVK HEIQPGTDLQ TDEEIDGFLR EHVESAYHPC GTCKMGAVDD
PMAVVDPECR VIGVEGLRVA DSSIFPRITN GNLNGPSIMT GEKASDHILG RRLPSSNAEP
WFNPNWQTSQ R