Gene TM1040_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0401 
Symbol 
ID4078795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp408686 
End bp411580 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content61% 
IMG OID638005696 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_612396 
Protein GI99080242 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.298134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAGGA AAAAGACCAA CGGGGTTGCG CGACGCCCCC AGCGGACCAG TATCCTGTCC 
AAGGTCGGTG AGTCAGCCGT CGACCGCCGC GCGTTCCTGC GCGGATCTGG CCTTGCCATC
GGCGGGCTTG CTGCCATCAG CGCAACCGGT GGTACAGTGA CGCAAGCCAA TGCGGCGGCG
TCTGCAACCG GCGCGATGGA AACGATCAAA TCCGTCTGCA CCCACTGCTC GGTCGGCTGT
ACCGTGGTGG CAGAGGTGCA AAACGGTGTC TGGGTTGGCC AGGAGCCAGG CTGGGACAGC
CCCTTCAACC TCGGTGCGCA TTGCGCCAAG GGCGCTTCGG TGCGTGAACA CGCTCATGGT
GAGCGCCGCC TGAAGTACCC GATGAAGAAA GAAGGCGGTG AGTGGAAGCG CATCAGCTGG
GAGCAGGCCA TCGACGAGAT CGGCGACGGC ATGATGCAGA TCCGCGAAGA AAGCGGCCCT
GATAGCGTCT ACTGGCTCGG TTCTGCCAAG CACAACAACG AACAGGCCTA CCTGTTCCGC
AAGTTCGCCG CCTACTGGGG TACGAACAAC GTGGATCACC AGGCCCGGAT CTGTCACTCC
ACCACGGTTG CGGGTGTTGC GAATACATGG GGCTACGGCG CCATGACCAA CAGCTACAAC
GACATCCATA AATCCAAGGC GATCTTTATC ATCGGTGGCA ACCCCGCCGA GGCGCATCCG
GTATCGCTGC TGCATGTGCT GAAGGCCAAG GAAGAGAACA ACGCGCCGCT GATCGTCTGC
GATCCGCGTT TCACGCGTAC GGCGGCCCAT GCGGATGAAT ATGTCCGCTT CCGTCCCGGC
ACCGACGTGG CGCTCGTTTG GGGCATCCTG TGGCATATCT TTGAAAACGG TTGGGAAGAC
ACCGAGTTCA TCCGCACCCG TGTCTGGGGT ATGGATCAGA TCCGGACCGA AGTGGCCAAA
TGGACGCCCG AAGAGGTCGA ACGCGTCACC GGCACCCCGG GTAGCCAGCT CAAGCGCGTT
GCGCGCACCC TGGTCAACAA CCGCCCCGGC ACCGTCATCT GGTGTATGGG TGGCACCCAG
CACACCAATG GCAACAACAA CACCCGCGCC TACTGCATCC TGCAGCTGGC CCTTGGCAAC
ATGGGTGTGT CCGGCGGTGG CACCAACATC TTCCGCGGCC ACGACAACGT GCAGGGGGCA
ACCGACCTTG GCGTTCTGAG CCACACTCTG CCGGGCTATT ATGGTCTGTC GGCTGGCGCA
TGGGGCCATT GGGGCCGCGT CTGGGGCGAA GACATGGACT GGCTGAAGGG TCAGTTTGAA
ACCGTCAAAG GCGCCGACGG CAAGGATAAG AACCTGATGA ACCTGACGGG CATTCCGGTG
TCCCGCTGGA TCGACGGTAT CCTTGAAGAC AAGGAAAACA TGGACCAGCC CAACAATGTT
CGGGCCATGG TTCTCTGGGG CCACGCGCCG AACTCTCAGA CCCGGATGAC GGAGATGAAG
ACGGCGATGG AGAAACTCGA CATGCTTGTC GTGGTTGACC CCTATCCGAC CGTCTCTGCC
GTGCTGCATG ATCGCACCGA TGGTGTCTAT CTGCTGCCCG CCTGCACCCA GTTTGAGACC
CGCGGCTCCG TGACGGCCTC GAACCGTTCG CTGCAGTGGC GCGATCAGGT GGTGGAGCCT
CTCTTTGAGA GTCTGCCGGA TCACGTCATC ATGGCCAAAT TCGCCAATAA GTTCGGCTGG
GCAGATCGTC TCTTCCGCAA TATCGAAATG GAAGACGCCG AGACCCCCAA CATCGAAAGC
ATCACCCGTG AGTTCAATGC GGGCATGTGG ACAGTGGGCT ACACGGGTCA GAGCCCGGAG
CGCATCAAGC TCCATATGGC CAATCAGCAC ACCTTTGATC GCACGACGCT TCAGGCCGTT
GGTGGCCCGG CGGATGGCGA TTACTACGGG ATGCCATGGC CCTGCTGGGG CACGCCGGAA
ATGAAGCATC CGGGCACGCC GAACCTCTAT GACATGTCCA AACCTGTCGC CGAAGGTGGT
CTGTGTTTCC GCGCCCGTTT CGGGGTGGAG CGTGATGGCG AAAATCTCCT GGCAGAAGGG
GTCTCCAACC CCGGTGCGGA GATTCAGGAT GGCTATCCTG AGTTCACCAT GCAGATGCTG
ATGGATCTGG GCTGGGATGG CGACCTGACG GCCGAGGAAC GCGCGGCGAT CGACGCCGTT
GCCGGGCCAA AGACCAACTG GAAAACCGAC CTCTCCGGTG GGATTCAGCG GGTTGCGATC
AAGCACGGCT GCGCGCCCTT CGGGAACGCC AAGGCTCGTG CGGTTGTGTG GACCTTCCCG
GATCCGGTGC CGCTGCACCG CGAGCCGCTC TACACCAACC GGCGTGACCT GGTGGCGGAT
TATCCGACCT ATGAGGATCG GAAATTCTAT CGTCTGCCCA CCATGTATGC CTCGATCCAG
AAGAACGATG TCTCCAAGGA GTATCCGATC ATCCTCACCT CCGGCCGTCT GGTCGAATAT
GAGGGCGGCG GTGACGAGAC CCGTTCGAAC CCGTGGCTTG CAGAACTGCA GCAGGACATG
TTCGTCGAGA TCAATCCGCG CGATGCCAAT GACATCGGAA TCCGCGATGG GTCTCAGGTC
TGGGTCGAAG GCCCGGAAGG CGGCAAGGTC AAGGTGATGG CAATGGTGAC AGAACGCGTC
GGGGCCGGTG TGGCCTTCAT GCCGTTCCAC TTTGGCGGGC ACTTCCAAGG TAAGGATCTG
AGGGATAAAT ATCCCGACGG GGCCGACCCT TACGTGCTGG GTGAAAGTAC CAACACCGCG
CAGACCTACG GCTATGACTC TGTCACGCAG ATGCAAGAGA CCAAAGCCAC CCTCTGCAAA
ATCTCAGCAG CCTAA
 
Protein sequence
MLRKKTNGVA RRPQRTSILS KVGESAVDRR AFLRGSGLAI GGLAAISATG GTVTQANAAA 
SATGAMETIK SVCTHCSVGC TVVAEVQNGV WVGQEPGWDS PFNLGAHCAK GASVREHAHG
ERRLKYPMKK EGGEWKRISW EQAIDEIGDG MMQIREESGP DSVYWLGSAK HNNEQAYLFR
KFAAYWGTNN VDHQARICHS TTVAGVANTW GYGAMTNSYN DIHKSKAIFI IGGNPAEAHP
VSLLHVLKAK EENNAPLIVC DPRFTRTAAH ADEYVRFRPG TDVALVWGIL WHIFENGWED
TEFIRTRVWG MDQIRTEVAK WTPEEVERVT GTPGSQLKRV ARTLVNNRPG TVIWCMGGTQ
HTNGNNNTRA YCILQLALGN MGVSGGGTNI FRGHDNVQGA TDLGVLSHTL PGYYGLSAGA
WGHWGRVWGE DMDWLKGQFE TVKGADGKDK NLMNLTGIPV SRWIDGILED KENMDQPNNV
RAMVLWGHAP NSQTRMTEMK TAMEKLDMLV VVDPYPTVSA VLHDRTDGVY LLPACTQFET
RGSVTASNRS LQWRDQVVEP LFESLPDHVI MAKFANKFGW ADRLFRNIEM EDAETPNIES
ITREFNAGMW TVGYTGQSPE RIKLHMANQH TFDRTTLQAV GGPADGDYYG MPWPCWGTPE
MKHPGTPNLY DMSKPVAEGG LCFRARFGVE RDGENLLAEG VSNPGAEIQD GYPEFTMQML
MDLGWDGDLT AEERAAIDAV AGPKTNWKTD LSGGIQRVAI KHGCAPFGNA KARAVVWTFP
DPVPLHREPL YTNRRDLVAD YPTYEDRKFY RLPTMYASIQ KNDVSKEYPI ILTSGRLVEY
EGGGDETRSN PWLAELQQDM FVEINPRDAN DIGIRDGSQV WVEGPEGGKV KVMAMVTERV
GAGVAFMPFH FGGHFQGKDL RDKYPDGADP YVLGESTNTA QTYGYDSVTQ MQETKATLCK
ISAA