Gene Dole_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2150 
Symbol 
ID5694994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2608856 
End bp2609977 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID641264752 
Producthypothetical protein 
Protein accessionYP_001530031 
Protein GI158522161 
COG category[S] Function unknown 
COG ID[COG0327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGTG CCACTGTCGG CGACATGGTC GCCATCATGA ACCGTCTGGC GCCGCCGGAC 
CTGGCGGAAA CATGGGACAA CTGCGGGCTT CAGGCGGGCT GTTTCGACCA GGAAGTGAAA
ACCGTGCTGG TGGCCCTGGA CCCCTCTTCG GCCGTGGTCC GGGATGCCTG TGACCGGGAC
GTCGATCTGC TGATCACCCA CCACCCCCTG TTTCTGTCGC CACCGCGGTC AGTTGACTTT
TCCCGCATGC CGGGAACGGC CATCTTTCTG GCCGCCACCC ACGGTCTTTC CATCTTCAGT
GCCCATACCA ATCTGGACAG CGCCGAGGGC GGCCTCAACG ACCGGTGCGC CGAATGTATT
GGTCTTCAGA ATGTGAGGGT GCTTTCCCGA GCAGGGCAGA CGGATCACGT CAAACTGGCC
TTTTTCGTTC CGGTGGAACA CGAGGCCCGG CTGCTGGAGG CTCTGGCCGC CACGCCGGCG
GGCGCATGGG GCCGGTACAG CAGCTGTTCT TTTTCCGTTC GTGGAACGGG CCGGTTCCAA
CCCCTGGAAG GAGCCGTTCC TTTTATCGGC CGTACCGGAG AGATCGTTGC CGTGGAAGAG
GTCCGGGTGG AAGCCATCGT GCCCCGTCGT GACCTGGACG GCGTGGTTCG AGTCCTCAAG
CAGGCCCATC CTTACGAGAC CATGGCCTAT GACGTTTTTC CCCTGGCTGG GGGCATCGAA
CCCCTGCACG GGCTGGGTCG TATCGGTGAG GTAGAACCCT CCACCCTGGA GGTGTTTGCC
GGCCGTGTCA AGCAGCGATT CGGTGTGGAC CGCGTCGGTG TGGCAGGCGA CATGGCCATG
CCGGTAAAAA CCGTGGCCGT CTGTTCCGGG GCCGGCTCCA GCCTGATCAG GGATTTTCTG
ACATCCGGGG CCGACGTGTT TGTCAGCGGC GACCTGAAGT ATCACGACGC CATGGCGGTG
GTTGAGGCCG GCCGGGCACT GATTGATGTG GGCCATTTCG AGACGGAGCA CCTGGTGGTG
GAACTGCTGG TGGATCGATT GACCCGGGAG ACAGCAGCGG CAGGGTATGA AGTGCGGGTG
GCCGGATACG CCGGCCAGCG CAATCCCTGC CGTTTTATAT GA
 
Protein sequence
MMRATVGDMV AIMNRLAPPD LAETWDNCGL QAGCFDQEVK TVLVALDPSS AVVRDACDRD 
VDLLITHHPL FLSPPRSVDF SRMPGTAIFL AATHGLSIFS AHTNLDSAEG GLNDRCAECI
GLQNVRVLSR AGQTDHVKLA FFVPVEHEAR LLEALAATPA GAWGRYSSCS FSVRGTGRFQ
PLEGAVPFIG RTGEIVAVEE VRVEAIVPRR DLDGVVRVLK QAHPYETMAY DVFPLAGGIE
PLHGLGRIGE VEPSTLEVFA GRVKQRFGVD RVGVAGDMAM PVKTVAVCSG AGSSLIRDFL
TSGADVFVSG DLKYHDAMAV VEAGRALIDV GHFETEHLVV ELLVDRLTRE TAAAGYEVRV
AGYAGQRNPC RFI