Gene TM1040_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2168 
Symbol 
ID4076767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2279428 
End bp2281509 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content62% 
IMG OID638007490 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_614162 
Protein GI99082008 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAC AAGCCAGATC CTCTCAAGAC ACGCGTGTCG TGCCCAGCGT TTGCCCGTTG 
GACTGCCCGG ACACCTGTAG CCTCAGCGTC GAGGTAACGG GCGATCAGAT TACTGCCGTA
CGCGGGTCAA ATGCCAATCC GTTTACGGCC GGTGTGGTCT GCAACAAGGT CACGCGCGCC
TACCCTGACT TTGTACATGG TCCGGCGCGG CTCACACATC CGCTGCGGCG GGTGGGGCCG
CGGGGGGCTG GCACGTTTGA ACGCATCAGT TGGGAGGATG CGCTGGATCT TGTGGTCGAA
GGGTTTCACG CTGCCATTGA TGCGCATGGC GCGCAGTCGA TCTTGCCGCT GAACTATGCT
GGACCTCACG GGGAGCTGGC GGGGGGCTCG CTCGATCGGC GATTTTTCTA CCGTCTGGGC
GCCACGCGAA TGGATCGCGG CCCGCTCTGC GGCGGCGTGC GCAGCCTCGC CTATGAGAGC
CTCTATGGCA ATGCTCCCGG CATGCCGCCG GAACAGGCCC AGCATGCCGA TCTCATCGTG
GTCTGGGGCA ACAATGTGAC GGTCTCCAAC CTGCATCTGA CACGGATCGT CAAAGCGGCC
CGCGCCAAGG GTGCGCGTCT GGTGGTGATT GACCCCAAGC GGATCAAGAT CGCAGAGCAG
GCCGACCTCT ATCTTCAGGT GCGACCCGGA ACCGATGTGG TGCTTGCCCA TGCGCTTGCG
GCTGAACTCG AACGGCGCGG CGCATTTGAT GCGGAGTTTA TTGCGCATTG GGTGCATGGC
GTTGACGAGT ATATGGACGA GGCGCGCAAG CACAGCGTTG AAAACGCTTC TGAAATCTGC
GGCCTCGCTG TCAGCGATAT CCTCAAGCTG GCGGATTGGA TGCAGACTGC GCAGAGGCTG
GCCACCTCCA CCGGCAATGG GATGGAACGG GGACGCTCCG GAGGTTCGGG GCTGCGCGCG
GCGATGGCGC TCAATGCGCT CCTGGGACAG CACGGGCGAC TGGGAGCCGG GGTGGTGGCC
AAATCCGGAC TGGCCTCCCC CAAGACACGC GCCCGGCTGC AGGGCGATCA TCTGATCGAC
CCTGAAACAC GGGTGGTGAA TATCCTTGAT GTGGCACAGC TGATGCTTGA TCGCGAGCAA
TCCACACCGA TCAATGCCGT CATGATCTAC AACCACAATC CTGTCGCGAC ACACCCCGAC
CAGACTCGGA TGATTGCGGC GTTGAGTCAG CCTGAGATCT TTACCGTTGG CTGCGACGTG
GTGATGACCG ACAGCATGGC ATATTGCGAT GTGATCCTGC CCGCAGCCAC CCACTTTGAG
CACGATGAGA TCTTTGCCGC CTATGGTCAG AACTATGTCC AGCGGGCCGA GCCCGTGATC
GCGCCCATCG CGGAAAGTCT GCCGAACACG GAGATTTTCC GCCGTCTGGC CGCGCGTTTT
GGCTATATCG AGCCAGAGCA TCAGGCTTCG GACGTGCAGT TGATGGATGA CGCGATGGAT
GCCAGCCATC CGCAATTTGA AGGCCGCCCC CCAAGCCAGA TGCCGGTGGA TCGCGCGTTG
GAAATGCAAA AAAGCGATGG CGCTGCTCTG GTCCTCTGCG AGACGGTGAC GCCCGACACA
GAAACAGGCC GGATTGAGCT GTTCAGCGCC GCGCTCGAGG CGGAGACCGG CTTTGGCGTG
CCGCGCTATG ATCCGGTGTC GCAGGACCTG CCTTTGGTGT TGATTACCCC GAGTTCGGAC
AAACGCATCA ACGCCACTTT CGGCAGCGCG CCGCTTTCCG CAGGGCCCGA GAGCATCGAG
GTGCACCCCG AAGACGCCGC CGCACGGGGG CTGCAGGATG GTGATTTGGT CGAGGTTGCC
AACGCCCGCG GAAGCGTCAC GTTCCGTCTC AAGATCTCGG ATGCGATGCA GCGCGGTGTG
ACGTATGCCC CCAAGGGTAC ATGGCGCATC AGTTCCAAGA CGGGCCTCAC CGCAAACGCT
CTGATCCCGG TGGAGGTACG CACCGATATT GGCGACGGGG CCTGCTATAA CGAGACCTTC
GTTGAGCTGC GCGCCTATGC CGGGGTGCGC GCCGCCGAGT GA
 
Protein sequence
MSRQARSSQD TRVVPSVCPL DCPDTCSLSV EVTGDQITAV RGSNANPFTA GVVCNKVTRA 
YPDFVHGPAR LTHPLRRVGP RGAGTFERIS WEDALDLVVE GFHAAIDAHG AQSILPLNYA
GPHGELAGGS LDRRFFYRLG ATRMDRGPLC GGVRSLAYES LYGNAPGMPP EQAQHADLIV
VWGNNVTVSN LHLTRIVKAA RAKGARLVVI DPKRIKIAEQ ADLYLQVRPG TDVVLAHALA
AELERRGAFD AEFIAHWVHG VDEYMDEARK HSVENASEIC GLAVSDILKL ADWMQTAQRL
ATSTGNGMER GRSGGSGLRA AMALNALLGQ HGRLGAGVVA KSGLASPKTR ARLQGDHLID
PETRVVNILD VAQLMLDREQ STPINAVMIY NHNPVATHPD QTRMIAALSQ PEIFTVGCDV
VMTDSMAYCD VILPAATHFE HDEIFAAYGQ NYVQRAEPVI APIAESLPNT EIFRRLAARF
GYIEPEHQAS DVQLMDDAMD ASHPQFEGRP PSQMPVDRAL EMQKSDGAAL VLCETVTPDT
ETGRIELFSA ALEAETGFGV PRYDPVSQDL PLVLITPSSD KRINATFGSA PLSAGPESIE
VHPEDAAARG LQDGDLVEVA NARGSVTFRL KISDAMQRGV TYAPKGTWRI SSKTGLTANA
LIPVEVRTDI GDGACYNETF VELRAYAGVR AAE