Gene Dole_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2583 
Symbol 
ID5695434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3124273 
End bp3125391 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID641265191 
Productprephenate dehydratase 
Protein accessionYP_001530463 
Protein GI158522593 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGTT CCGCATCCAC GGATGTTGTA TCCATGCCTT CCGGCCTTGA CGACCTAAGG 
CGGGAGATCA CCCGCACCGA CAGGGAGCTT CTGGCCCTGC TCAACCATCG TGCCCGGCTC
TGCCGCCAGG TGGGCCGGGT CAAGTCGGCC GCCGACCAGG CGGTGTTCAA GCCCTTTCGG
GAAAAGGAGG TGCTTGAAGG GCTGGTGGCG GAAAACCCCG GAGACCTTCC CGACGACCAC
CTGCGCACCA TCTACCGGGA GATCCTTTCA TCCTCCCGGC GCTTGCAGCA ACCCCAGAAG
GCGGTCTACC TGGGGCCGGA AGGCACCTTT TCCTATTTTG CCGGCCGGGA GCTGCTGGGC
AGCAGCACCG ACTTTGAACC CTGCCCCAGC CTGGAGACGG TGTTTGCCGC GGTTGCCGGC
AAAAAGGCCG ACCTGGGCAT CGTGCCCCTG GAAAACTCCC TTTCCGGCAG CGCCGGCCAG
AACCTGGATC TTTTTCTCCG CTACGGTGTG CATATTCAGG CTGAAATCTA CCTGCGTATC
AGCTATCACC TGGTAGGGGC CGGCACCGGA CTTGCCGGTA TTCAGACCGT TTACTCACAC
CCCCGGGCCA TTGACCAGTG CGCGGCCTGG CTTTCGAGCC ATCTGCCCGA GGCCCATGTG
GTGTTTGTGG GCAGCACGGC CGCCGCGGCC CGTGAGGCGG CGGGCCGGCC TGACTGCGTC
GCCGTGGGCC ACCGCCAGCT GGCCGCCATG TTTTCCCTCA ACCTGCTGGC CGGGCCCGTC
GAAGACGCGC CGGACAACTG GACCCGATTC ATCGTCATCG GTCACCAGGC CCCTGCCGGC
GGCAGCCGGG ACAAGACATC AATTCTGTTC ACCCTGCCGG ACAAGTCCGG GGCCCTGGTC
AGTGTGCTTT CCGTTCTGGC CAGAGGGGGC ATCAACATGA AAAAACTTGA ATCCCGGCCC
ATGCGGTCTG AAAAATGGCA GTACCTGTTC TTCGCGGACC TGGAGTGCGA CCTCTCCGAT
GACGAGTATG CCGACCTTCA GGCCGAACTG GTTGAAAACT GCCAGACCCT GCGGGTGCTG
GGGAGCTATC CTGCGGGGCT GCATCTGAAC GACTGCTGA
 
Protein sequence
MNRSASTDVV SMPSGLDDLR REITRTDREL LALLNHRARL CRQVGRVKSA ADQAVFKPFR 
EKEVLEGLVA ENPGDLPDDH LRTIYREILS SSRRLQQPQK AVYLGPEGTF SYFAGRELLG
SSTDFEPCPS LETVFAAVAG KKADLGIVPL ENSLSGSAGQ NLDLFLRYGV HIQAEIYLRI
SYHLVGAGTG LAGIQTVYSH PRAIDQCAAW LSSHLPEAHV VFVGSTAAAA REAAGRPDCV
AVGHRQLAAM FSLNLLAGPV EDAPDNWTRF IVIGHQAPAG GSRDKTSILF TLPDKSGALV
SVLSVLARGG INMKKLESRP MRSEKWQYLF FADLECDLSD DEYADLQAEL VENCQTLRVL
GSYPAGLHLN DC