Gene TM1040_3190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3190 
Symbol 
ID4075294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp181083 
End bp183857 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content59% 
IMG OID638004699 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_611426 
Protein GI99078168 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.523924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATA AAATCACCTT TACGCTCGAC GGCGCGCAAG TCACCGCCGA CGCTGGCATG 
ACCATCTGGG AAGTCGCCAA TGGCCGCGGC CTCAAGATCC CGCATCTCTG CCACAAGCCG
CAGCCCGGCT ATCGCCCCGA CGGCAATTGC CGCGCGTGCA TGGTCGAGAT CGAGGGCGAG
CGCACGCTTG CGGCCTCCTG CATTCGCGAG CCCAGTGAGG GCATGGTTGT CACCACCAAC
AATGCACGCG CCGAAAACGC GCGAAAGATG GTGATGGAGC TGCTGGTTGC GGATCAGCCC
GCGCAGGAGG TTGCGCATGA TAAGTCCTCG CATATGTGGG ATATGGCAGA GTTGAACGGG
GTTTCGGAGT CGCGCTTTCC CAAGCTCGAA GAGGGGCACA TCCCGCTTTT GGATGACAGC
CATGTCGCCA TGAGCGTAAA CCTTGATGCC TGTATCTCTT GCGGGCTCTG CGTGCGCGCC
TGCCGTGAAG TGCAGGTGAA CGACGTGATT GGTATGGCGG GCCGGGGGCA CAATGCCTAT
CCGACCTTTG ATATTGCCGA CCCGATGGGC GAGTCCTCCT GCGTGGCCTG CGGCGAATGC
GTGCAGGCCT GCCCGACGGG CGCACTGATG CCCGCGACCG TGGTGGATGA AAACCAGGTG
GGCGATCGCA AGGACTTTGA TTCAGAAACC GAAAGCGTCT GCCCCTTCTG CGGGGTCGGC
TGCAAAGTCT CGCTCAAGGT GAAAGATGGT AAGGTCAAAT ATGTCGAGGG CATCAATGGC
CCCGCGAACG AAGGCCGTCT CTGCGTGAAA GGTCGTTTCG GCTTTGACTA TATCCACCAT
CCGCACCGCC TCACCAAACC ACTGATCCGT CGCGAGGATG CACCGGACAA AGGGCTGAAT
GTCGATCCCG GGAACCTGAT GACTCACTTC CGCGAGGCAA GCTGGGATGA GGCGATGGAT
CTGGCCGCGA AAGGTCTGAT CAAGCTGCGG GACGCAGACC CCAAATCTGT CGCAGGTTTT
GGGTCGGCAA AATGCACCAA TGAAGAAGCC TATCTCTTCC AGAAGTTTAT TCGTCAGGGC
TTCAAGCACA ACAACGTCGA TCACTGCACC CGCCTGTGTC ATGCCTCTTC TGTGGCGGCA
CTGATCGAGA ACGTGGGCTC GGGGGCTGTG ACCGCAACCT TCAACGAGAT CGAGAACGCG
GATGTAGCGA TCATCATCGG CGCCAACCCG ATTGAGAACC ACCCTGTTGC TGCGACCTAT
TTCAAGCAGT TCACCAAACG CGGTGGCAAG CTGATTGTGA TGGATCCGCG CGGTGTCGGC
ATGCGCCGAT ATGCGGACGA GATGCTCCAG TTCCGTCCAG GTGCCGATGT GTCGATGCTC
AATGCGATCA TGAACGTGAT CGTGGAAGAA GAGCTCTATG ACAGCCAGTA TATCCACCGC
TGGACCGAAA ACTGGGAGGC TGAAAAAGAG CACCTGCGCC AGTTCACGCC AGAAAAGATG
TCGGAAATCT GCGGCATCGA GCCAGAGCAG CTGCGCCGTG TGGCCCGGAC CTTTGCAGGT
GCCGAGGCGG GGATGATCTT CTGGGGCATG GGCGTCAGCC AGCACATTCA CGGCACTGAC
AACTCGCGGT GCCTGATCTC GCTGGCCTTG ATGACAGGCA ATGTCGGTAA ACCCGGTGCG
GGCCTGCACC CCTTGCGGGG TCAGAACAAT GTGCAGGGTG CGTCGGATGC GGGTCTCATT
CCGATGTTCC TGCCGGACTA CCAGACTGTC ACCTCTGACG ATGTCCGTCG CAGCTTTACA
GATGTCTGGG GTGGGGGCGA TTTCTCCAAT GAGAAGGGCC TCACCGTGAC AGAAATCGTG
GATCAGGTTT ATGCGGGCAA TATCAAAGGC ATGTACATTC AGGGCGAGAA CCCTGCCATG
TCCGACCCTG ACGCCGATCA CGCGCGCGAG GCCTTTGCCA AGCTCGACCT GATGATCGTG
CAGGATATTT TCCTGACCGA GACGGCGAAT TTCGCCGACA TCATCCTGCC CGCCTCGACC
CTTTATGAAA AGAACGGCAC GGTGTCCAAT ACCAACCGTC AGGTGCAGCG CGTGCGGCCT
GCCGTCACCC CTCCGGGTGA GGCGCGCGAG GATTGGAAGA TCACCGTAGA ATTGGCACAG
CGGATCGGTT TGCCTTGGGC CTATAGTGAT GTTTCAGAGG TTTTTGCCGA GATGAAACTC
AATATGAAGT CGCTCGACAA TATCACCTGG GAACGTCTGG AGGTGGAGAC CATCACCTAT
CCCTCGCTGC ATGAGACCGA CCCGGGTCAG GCGATTGTGT TTGGCGATGG CTTCCCGCGC
CCCGAGGGGC GGGCGAGATT TACGCCTGCC TCGGTGATCC CGCCGGATGA GGCCCCGGAT
GCGGACTTCC CGATGATCAT GACCACGGGG CGTCAGCTCG AACATTGGCA TACGGGCTCC
ATGACGCGGC GCTCGCTTGT GTTGGATGCG GTTGAGCCGG AGGCAAACTG TTCGTTGCAT
CCGCGTACCC TGCGCACCCT TGGGGTTGAG CCGGGCGAGA TGGTTCGACT GTCCACGCGT
CGCGGCTCGA TCGAGATCAT GGCTCGTGCG GACCGCGCGG TGGCCGAAGA CATGGTCTTT
GTGCCCTTTG CCTATGTCGA GGCAGCGGCC AATATCCTGA CCAACCCGGC AATCGATCCC
TACGGCAAGA TCCCCGAGTT CAAGTTCTCG GCTGTGCGGG TTGAGAAGGC AGAAGGGCAG
ATCGCAGCCG AGTGA
 
Protein sequence
MSDKITFTLD GAQVTADAGM TIWEVANGRG LKIPHLCHKP QPGYRPDGNC RACMVEIEGE 
RTLAASCIRE PSEGMVVTTN NARAENARKM VMELLVADQP AQEVAHDKSS HMWDMAELNG
VSESRFPKLE EGHIPLLDDS HVAMSVNLDA CISCGLCVRA CREVQVNDVI GMAGRGHNAY
PTFDIADPMG ESSCVACGEC VQACPTGALM PATVVDENQV GDRKDFDSET ESVCPFCGVG
CKVSLKVKDG KVKYVEGING PANEGRLCVK GRFGFDYIHH PHRLTKPLIR REDAPDKGLN
VDPGNLMTHF REASWDEAMD LAAKGLIKLR DADPKSVAGF GSAKCTNEEA YLFQKFIRQG
FKHNNVDHCT RLCHASSVAA LIENVGSGAV TATFNEIENA DVAIIIGANP IENHPVAATY
FKQFTKRGGK LIVMDPRGVG MRRYADEMLQ FRPGADVSML NAIMNVIVEE ELYDSQYIHR
WTENWEAEKE HLRQFTPEKM SEICGIEPEQ LRRVARTFAG AEAGMIFWGM GVSQHIHGTD
NSRCLISLAL MTGNVGKPGA GLHPLRGQNN VQGASDAGLI PMFLPDYQTV TSDDVRRSFT
DVWGGGDFSN EKGLTVTEIV DQVYAGNIKG MYIQGENPAM SDPDADHARE AFAKLDLMIV
QDIFLTETAN FADIILPAST LYEKNGTVSN TNRQVQRVRP AVTPPGEARE DWKITVELAQ
RIGLPWAYSD VSEVFAEMKL NMKSLDNITW ERLEVETITY PSLHETDPGQ AIVFGDGFPR
PEGRARFTPA SVIPPDEAPD ADFPMIMTTG RQLEHWHTGS MTRRSLVLDA VEPEANCSLH
PRTLRTLGVE PGEMVRLSTR RGSIEIMARA DRAVAEDMVF VPFAYVEAAA NILTNPAIDP
YGKIPEFKFS AVRVEKAEGQ IAAE