Gene Dred_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_1794 
Symbol 
ID4956908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp1965581 
End bp1966969 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content43% 
IMG OID640180968 
Producthydrogenase large subunit 
Protein accessionYP_001113144 
Protein GI134299648 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGG TATCGGAAGT GACTAGACTC CGTAGAGATA TCTTTACCAG AGTAGCGCAC 
TGGGCTTTAA ACCACCCTCT AGATGTACCT ATGGAAATGG ATATATCTCA AGTTGTAAAA
GAAATCATAC CGGACGGACC GGCTCGTTAT CGTTGTTGCA TCCATAAAGA AAGAGCAATT
GTGGAAGAAC GGATTAAGAT TGCTGTGGGC TTGATTCAAC CTGAACATCG AGTGACGGTT
ATATCAGAAG CCTGTAATGG ATGTTCGCTA AATAAATATG TAGTAACAGA TGCTTGTCAA
AACTGTGTGG CACATCCATG CAGAAACAGT TGCCCTAAGA AAGCCATATC AGTAATTCAA
AATCGCGCCT TTATTGATCA AAATTCCTGT GTGGAATGTG GGAAATGTGC TAATGCCTGT
CCTTATAACG CTATCATTGA GGTTACCCGT CCCTGTGAAA GGGCCTGTGC CCTAAAGGCC
ATAAAAGTTG ATGATTCCCG CAAGGCAGTG ATTGACCACG AGTTGTGTGC ATCCTGTGGT
TTATGCGTCA CCGTGTGCCC CTTTGGTGCC ATCACAGATC ATTCCCAAAT GATCGATGTT
ATATGTTCTC TTAGGGTAAG TGAGCAACCG CATGTGGCCG TGATAGCGCC TGCCATAGCA
GGTCAATTTG GTCTAAAGGT TAGTCCCGAC CAAATCAAAG CTGCACTTTT GAAATTAGGT
TTTGATGAAG TCATTGAAGC TGCTTTAGGG GCAGATATGG TTGCCCAAGA AGAAGCTGGA
GAGATTAAAC TTCATGCAGA AGATAAAAAA ATTATGTTAA ACTCATGCTG CCCGGCTGTT
GTAAAGGCTG TGTCACTAAA ATTACCAGAA TTGAAGGATT GTATCTCCAC TACCCTATCT
CCCATGCGTG TAACTGGCAA ACTGATTAAA GAAAGATACG ATAATAGAGT AATAACTGTC
TTTATAGGTC CTTGTATAGC AAAAAAAGAG GAAGCAACCC ATGGTGATGA GATTGATATG
GTGTTAACCT TTGAAGAATT AGCGGCAATA TTCTCGGCAT CTGAATTAGA AATCTCTTCC
CTCGAGCCTG TGGAATTAAA GGATGCGTCT CCTAATGGCA GATTATTTGC CAGAGCCGGT
GGTGTAAGTA CGGCGGTTCG CAAGCATCTG GGAGAAATGG AACTGAATGT TCTGCGGGTG
CAAGGTTTAG GACAATGTCT AACAACATTA AAAACCTCTG CAAAAAAATC AGACTCTAAT
TTTACTTTTA TTGAATGTAT GGCCTGCGAA GGCGGTTGTG TGGGTGGACC GGGGACGATG
GTTGCATCCA CAGTGGGAAC AAGGGCTGTT GAAAAACATG CTGGAGAAAG TATCATATTA
CCCAAATAA
 
Protein sequence
MNPVSEVTRL RRDIFTRVAH WALNHPLDVP MEMDISQVVK EIIPDGPARY RCCIHKERAI 
VEERIKIAVG LIQPEHRVTV ISEACNGCSL NKYVVTDACQ NCVAHPCRNS CPKKAISVIQ
NRAFIDQNSC VECGKCANAC PYNAIIEVTR PCERACALKA IKVDDSRKAV IDHELCASCG
LCVTVCPFGA ITDHSQMIDV ICSLRVSEQP HVAVIAPAIA GQFGLKVSPD QIKAALLKLG
FDEVIEAALG ADMVAQEEAG EIKLHAEDKK IMLNSCCPAV VKAVSLKLPE LKDCISTTLS
PMRVTGKLIK ERYDNRVITV FIGPCIAKKE EATHGDEIDM VLTFEELAAI FSASELEISS
LEPVELKDAS PNGRLFARAG GVSTAVRKHL GEMELNVLRV QGLGQCLTTL KTSAKKSDSN
FTFIECMACE GGCVGGPGTM VASTVGTRAV EKHAGESIIL PK