Gene Dred_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_2042 
Symbol 
ID4958038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp2241872 
End bp2242921 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content42% 
IMG OID640181211 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_001113384 
Protein GI134299888 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000268939 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGAACT TATTTGTTAA TTTAGCCAGC GGGTCTCGGA CCCTGCTGGG GTCCGCTGGC 
CTACCCGGTG CGGCAACTGA TTTCATAGTA ATGTTCTTAA AATTAGGTGC TATTCTTGTA
TACATCTTAG TCAGTGCTCT CTGGCTGGTG TACATGGAAA GGAAAGTATC GGCCTATATG
CAGTGTCGGA TAGGTCCTAA CCGGGTTGGA CCCTTGGGTT TGTTACAGAC CACAGCGGAT
ATCGGGAAAT TAATAAGCAA AGAAATTATT ATTCCTAGAT GTGTAGATAA AAAGTTGTTT
TTGCTGGGAC CTATGTTGAT TTTTATGCCA CCCTTGGCAG TCTTTGCTGT TGCTCCCTTT
GGCAAAGATA TGGTGGCCAT CGATTTGAAC ATAGGAGTTT ACTACTTCTT GGCTGTAGCT
TCTTTATCAA CTGTAATTGT CTGGATGTCT GGTTGGGCCT CTAACAACAA GTACTCCTTA
ATTGGAGGTA TGCGCGTAGT GGCTCAAATG GTAAGCTACG AAATGCCTTT AATTTTATCC
ATTGTCGGGG TCATCATTTT AACCGGAACC TTAAACATGA GCGAAATTAT CCAGGCACAG
GAAGGAGTTT GGTTTATCTT TCTGCAACCC CTTGGTTTTT TAATTTACTT AATCGCAGGA
GTTGCCGAAA CAAACCGGGC CCCCTTTGAC TTAGTAGAAG GAGAATCGGA AATTATCTGC
GGACCCTTTA CTGAATATAG TGGCATGGGT TTTGCCATGT TCTTTCTGGC TGAGTATGCC
AATGTTGTGC TTGTTTCCGT AATGGCAACC ACTTTGTTTT TAGGAGGTTG GCAAGCACCC
TTTGGGCTTA CTTTTATTCC ATCCTGGATT TGGTTTTTGT TTAAAGTATA TGTGATGATT
TTTCTCTTCA TGTGGTTCCG TTGGACCTAT CCAAGGGTTA GGGTGGATCA GTTAATGGAA
TTTGGTTGGA AGGTACTGGT TCCTCTTTCT ATTGCGAATA TTTTCTTAAC TGGTATTGGT
AAATATCTGT ATCAAACACT AGGGTGGTGA
 
Protein sequence
MENLFVNLAS GSRTLLGSAG LPGAATDFIV MFLKLGAILV YILVSALWLV YMERKVSAYM 
QCRIGPNRVG PLGLLQTTAD IGKLISKEII IPRCVDKKLF LLGPMLIFMP PLAVFAVAPF
GKDMVAIDLN IGVYYFLAVA SLSTVIVWMS GWASNNKYSL IGGMRVVAQM VSYEMPLILS
IVGVIILTGT LNMSEIIQAQ EGVWFIFLQP LGFLIYLIAG VAETNRAPFD LVEGESEIIC
GPFTEYSGMG FAMFFLAEYA NVVLVSVMAT TLFLGGWQAP FGLTFIPSWI WFLFKVYVMI
FLFMWFRWTY PRVRVDQLME FGWKVLVPLS IANIFLTGIG KYLYQTLGW