Gene TM1040_0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0845 
Symbol 
ID4076020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp895892 
End bp897502 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content64% 
IMG OID638006143 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_612840 
Protein GI99080686 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.189337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACT TTGACTATAT CATTGTGGGA GCCGGATCCG CGGGCTGTGT GCTGGCCGAG 
CGCCTGAGCG CCAATGGCCG CCATAGTGTG CTTGTGCTGG AGGCCGGAGG TCGCCCGCGC
ACACCATGGA TCGCGTTGCC GCTTGGCTAC GGCAAGACCT TCTATGACCC GGCGGTGAAC
TGGAAATATC AGACTGAACC CGAAGAGACA CTGGGCGGAC GCGCCGGATA TTGGCCGCGT
GGCAAGGTCG TGGGGGGATC GGGTGCGATC AATGCGCTTG TCTACGCCCG TGGCCTGGCG
CGCGATTTCG ACGATTGGGA AGAGGCGGGC GCGACGGGCT GGAACTGGGA CGCGGTCCAG
AAAACCTATG AGCGCCTTGA GAGCCGCTTT GATGTCGATG GCACCCGCAC CGGCGAGGGG
CCGATTCACG TTCAGGATGT CTCGGACCAG ATCCACCGGG CCAACCGGCA TTTCTTTGCC
GCAGCGAAAG AGCTGGGTCT GCCACGGACA CCCGATATGA ACGGTATCAC CCCCGAAGGC
GCGGGCGTCT ACCGGATCAA CACCAGCGGT GGGCGCAGGA TGCATTCGGC GCGCGCCTGT
TTGGCTCCTG CGCTCCGGCG CGCAAATGTG ACGCTGATGA CGGGCGTTCT GGTGGAGCGG
ATCGGCTTTG AGGGAAAGCG GGCCACCTCC GTCGAGGTGG TCCACAAGGG GCGCGCGCAG
TCCTTGCAGG CCGGGCGAGA GATCATTCTC GCGGCAGGGG CTGTAAATTC ACCGCGCATC
TTGCAACTCT CGGGGCTTGG CCCCGCGGAG CTGCTGCGTG AGCATGGGAT CGCGCCGCTG
ATGGATGCGC CTCATGTAGG TGGCAACCTG CAGGATCATC TGGGCATAAA CTATTATTTC
CGTGCCACCG AACCCACGCT CAACAACGTG CTGAGGCCGC TCCATGGCAA GATCCGCGCA
GCGCTGCAAT ATGCGCTCAC GCGGCGCGGG CCGCTCGCGC TCTCGGTCAA CCAATGTGGT
GGATTTTTTC GCTCGGATGC GGGGCAGCGG GCGGCTGATC AGCAGCTTTA CTTCAACCCC
GTGACCTATA CCACCACACC GGACGGCAAA CGCACGGTGG TGCAGCCCGA CCCCTTTGCG
GGCTTTATCC TTGGGTTTCA GCCCACCCGG CCCATCAGCC GGGGGCGAAT CGACATTTCC
GCCGCCGACG CGCTTGCGCC GCCCCGGATC AGGCCGGACT CGCTGGCTGC TCAGGAAGAT
CAGGCGCAGG TGATCGCAGG CGGGCTGCTC TGTCAGAAGA TCGCCAAGAC CGAGGCGCTC
AGCCGCTTGA TCGCCGCGCC CATGGGCGAG GATCTGCGCG AGATGACACC GGAGCAGATC
CTAGCGGACT TTCGCGAGCG CTGCGGCACC GTGTTTCACC CGGTCGGCAC CTGTCGCATG
GGTGCAGACA GCACCAAGTC CGTGGTTTGC CCTCGGCTCA AGGTGCATGG GGTCGCGGGG
CTGCGGGTCG TTGATGCCTC GGTCTTCCCG AATATCACCT CGGGCAACAC CAACGCCCCA
ACCATGATGC TTGCCACCCG CGCGGCCGGT CTCATTCTGG AGGACGCATG A
 
Protein sequence
MRDFDYIIVG AGSAGCVLAE RLSANGRHSV LVLEAGGRPR TPWIALPLGY GKTFYDPAVN 
WKYQTEPEET LGGRAGYWPR GKVVGGSGAI NALVYARGLA RDFDDWEEAG ATGWNWDAVQ
KTYERLESRF DVDGTRTGEG PIHVQDVSDQ IHRANRHFFA AAKELGLPRT PDMNGITPEG
AGVYRINTSG GRRMHSARAC LAPALRRANV TLMTGVLVER IGFEGKRATS VEVVHKGRAQ
SLQAGREIIL AAGAVNSPRI LQLSGLGPAE LLREHGIAPL MDAPHVGGNL QDHLGINYYF
RATEPTLNNV LRPLHGKIRA ALQYALTRRG PLALSVNQCG GFFRSDAGQR AADQQLYFNP
VTYTTTPDGK RTVVQPDPFA GFILGFQPTR PISRGRIDIS AADALAPPRI RPDSLAAQED
QAQVIAGGLL CQKIAKTEAL SRLIAAPMGE DLREMTPEQI LADFRERCGT VFHPVGTCRM
GADSTKSVVC PRLKVHGVAG LRVVDASVFP NITSGNTNAP TMMLATRAAG LILEDA