Gene Dole_0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0250 
Symbol 
ID5693068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp284628 
End bp285779 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content54% 
IMG OID641262830 
Productputative transcriptional regulator 
Protein accessionYP_001528137 
Protein GI158520267 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT CCCAGACCAT TGTTAATCCT TATCCCAGCC CGATCAGCTA CAAAGGGGAA 
TATCATTATC GCAGCGGCAG CACTAAACAG CAGCTCAAAG GGGCGGCGCT GGAAACCTTT
CTGCTGCGCA AGCAGGGCCG GCATTGGGAC AGCGTGCCGA CGCCCTATGC CAAAGAAGAG
GACCTGGATG CCAATGCCTT CAAACAATTC CGACGGGCTG CGGCCAAAAG CGGTCGCATG
GATGCGAGCG TCCTGGAAGA CAGCCCGCGG CGCATTCTGG AAAACCTGAA TCTGATTGAA
GGCGGCCATT TGCGCCGGGC GGCGGTTCTG CTGTTTCACG AAACCCCGGA ACGTTTTATC
ACCGGCGCCT ATGTGAAAAT CGGCTTTTTC CGCACCGACG CCGATCTGAT TTATCAGGAT
GAGGTTCAGG GCAACCTCTT TGATCAGGCC CGCAAAACCC TCGATCTGCT CCTGACCAAA
TACATGAAAG CTTATATCCG CTACGAGGGC ATCACCCGGG TGGAACGATT TCTTTTTCCT
CCCGAAGCCC TGCGTGAGCT CATCCTCAAC GCGCTGGTGC ATCGGGATTA CGGCAGCGGC
GCGCCCATTC AGATTCGCGT GTACGAGGAT CAATTGTGGA TTGCCAACGA TGCCATCATT
CCGCCGGACT TTACCGTCGA GCATCTGCTT TCCCGCCATG TCTCCAAACC CCACAATCCG
CTCATTGCCG GCGCCTTTTT TCGCACCGGC GATATCGAGT CCTGGGGACG CGGCATCGAA
AAAGTCCGCA CTGCCTGCGA GGAAAACGGC ACGGATTTCC CCTCCTTTCG GTTTGAGCCG
ACTGGTCTCA TGGTCATGTT CAAAGGCCGG ATTCCTGTGG AAGAAGCAAC CCCAACTGAA
ACAGAAGCGT CGGGAAAACG TCGGGAAAAC GTCGGGAAAG TGTCGGGAAA GATTCTTGAT
GCCTGTCGGG AAAATCCATC CATAACCATA CCTGAAATGG CAGAACTGAT CGGCATTACC
GAACGCTCTA TCCAGAGGAA TATTCAGAAA TTGAAGACCG ATGGTTTTCT TTGTCGCGTG
GGTGGCAGAA AAGAAGGCCA CTGGGAGGTG ACGGGCGAGA ATGGAGAATT GAAAATGGAG
AATGGAGAAT GA
 
Protein sequence
MKESQTIVNP YPSPISYKGE YHYRSGSTKQ QLKGAALETF LLRKQGRHWD SVPTPYAKEE 
DLDANAFKQF RRAAAKSGRM DASVLEDSPR RILENLNLIE GGHLRRAAVL LFHETPERFI
TGAYVKIGFF RTDADLIYQD EVQGNLFDQA RKTLDLLLTK YMKAYIRYEG ITRVERFLFP
PEALRELILN ALVHRDYGSG APIQIRVYED QLWIANDAII PPDFTVEHLL SRHVSKPHNP
LIAGAFFRTG DIESWGRGIE KVRTACEENG TDFPSFRFEP TGLMVMFKGR IPVEEATPTE
TEASGKRREN VGKVSGKILD ACRENPSITI PEMAELIGIT ERSIQRNIQK LKTDGFLCRV
GGRKEGHWEV TGENGELKME NGE