Gene Dgeo_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1949 
Symbol 
ID4057483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2052083 
End bp2053222 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content66% 
IMG OID641230981 
ProductNADH dehydrogenase 
Protein accessionYP_605412 
Protein GI94986048 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC TCATTCTCGG TGCTGGCTAC GCGGGCCTCG CGACGGCCAC CAAGCTGAAA 
CCCACGCCTG GCCTGGAAGC CCTGCTGATC GACCAGAACC CCTACCACAC CTTCGAAACC
CGCCTGCACG AGGCGGCGGC GCACAACACG CCCGTCACGC TGCCCATCCT GCCGCTGCTG
CGCGGCACGG GGGTGCATTT CGAGCAGGCG TCGGTCGAGA ATGTGAACGT CGACGAGCGC
GAGGTCAAGC TCAAAGATGG CCGCGTGCTG ACCTACGACA AGCTGGTCGT GGCGCTAGGC
TCGGTGACCA ATTTCTACCG GATTCCCGGT CTGGCCGAGC ATGCCGCCGA ACTCAAGCAG
CTCAGCGACG CCGACGAGAT CTTTAACTTC ATCAACCGCG TCTACAGCAG TGACTACCAG
GGCAACCGCG ACATCGTGGT GGGCGGCGCG GGGCTGACCG GCGTCGAGCT GGTGACCGAA
CTCGCGCAGC GGGCCGAGGT GCTCAGCCGT GAGCGCGGCC TGCCCCCAGT CCAGATCTAC
CTGGTGGAAG CCGGGCCCAA GATTCTCCCG GTCCTCGACG ACGCCCTGCG CGCCAAGGCC
GAGAAGACCC TGCGCGACTA CGGTATCCAC ATCCTGGTGG GTCACCGCAT CACGAGTGCG
GCTGCGGACA GCGTGACGGT ACAGACGCAG GACGGTCAGG AGCAGGTGAT TCCCGCCGGC
AAGATCATCT GGACCGGCGG CATCCAGGCC CGCAACATCG TGCAGGGCGA ACACCTCGAA
AAGGGCCCCG GTGGGCGCAT CGCCGTGGAC GAGTACCTGC GCGCCAAGAA CTATCCCGAC
GTGTTCGTGA TCGGGGACAT GGGCCTGGCC CTCAACCAGG AAGGCAAGCC GGTGCCCACC
ACCGCCCAGC ATGCCGGACA GCAGGGCCGC CTGACCGGCA AGAATCTGAT GCGCCTTGCC
AAGGGCGAGC CGCTCGAACC CTATGAGCCG ACCACACTGG GCGAATTTGT CTCGCTGGGC
GGCCTGATGG CGGTGGGCTG GATGAAGCTT CCCTGGAACC AGAAGCTCGC CATTACCGGC
GGCCTCGCCC ACGTGATGAA GCGGGCGTCG GAGTGGCGCT GGCGGGCCAG CATCGACTGA
 
Protein sequence
MKTLILGAGY AGLATATKLK PTPGLEALLI DQNPYHTFET RLHEAAAHNT PVTLPILPLL 
RGTGVHFEQA SVENVNVDER EVKLKDGRVL TYDKLVVALG SVTNFYRIPG LAEHAAELKQ
LSDADEIFNF INRVYSSDYQ GNRDIVVGGA GLTGVELVTE LAQRAEVLSR ERGLPPVQIY
LVEAGPKILP VLDDALRAKA EKTLRDYGIH ILVGHRITSA AADSVTVQTQ DGQEQVIPAG
KIIWTGGIQA RNIVQGEHLE KGPGGRIAVD EYLRAKNYPD VFVIGDMGLA LNQEGKPVPT
TAQHAGQQGR LTGKNLMRLA KGEPLEPYEP TTLGEFVSLG GLMAVGWMKL PWNQKLAITG
GLAHVMKRAS EWRWRASID