Gene Dole_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1950 
Symbol 
ID5694790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2360036 
End bp2361136 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content61% 
IMG OID641264548 
Productprephenate dehydratase 
Protein accessionYP_001529831 
Protein GI158521961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCTG AAAGCAGCGG CCAGGATGAA AAGATAGCGA ACCTGCGCCG GTCCATTGAT 
GAAATTGATG ACACCATCCT GGACCTGCTC AACCGGCGGG TCTCTCTGGC CGAAGCGATC
GGGACGCTGA AGACGCAGAC CGGCAACCGG GTCATGGACA AGGCCAGGGA AGAATCGATC
CTGCAGCGGC TGGCCGGGCT CAACCCCGGC CCCTTGTCCT CTGAGATGCT GCGGCGGATA
TTTGTCGACA TCATTGCGGC CTCGCGTCAG GCCCAGGAAC CCAAGCGGAT CTCCTTTCTG
GGGCCGGAGG CCACCTTCAC CCATGTCGCG GCCCTGGCTT TTTTTAATGA GCTGGATACC
TTTGTCCCCC ACCCGAGTAT TCGGGACGTG TTTGATGACG TGGAAAAGGG GACCAGCCGG
TACGGCGTGG TGCCGGTGGA AAATTCCATT GAGGGCGCGG TCAACCACAC CCTTGATCTT
TTCCTGGAAT CCGAGCTTCA CATCTGCGCC GAGTCCTACC TGGCCATTTC CCATGACCTG
CTTTCAAAAA GCGGTGACCT GGAAAAGATT CATACCATCT ATTCCCACCC CCAGCCCTTT
GCCCAGTGCC GGACGTGGCT CAAGACCCAT CTGCCCCATG CCGAACTGGT GGAGTGCGGC
AGCACCTCCC AGGCGGCCCA GAAAGCCCTA CTGGCCGACG ATGCCGCGGC CATTGCCGGC
AGCGCCGCGG CCCGGCTGTA TGACCTGAAG GTGGCGGCGC CGGCCATTCA GGATGCCGTG
CGCAACACCA CCCGGTTTCT GGTCATCGGC CGGGACGCGC CCCGGCCCAC AGGCAACGAC
AAGACATCCA TCCTGTTTGT GACGGCCCAT ATTCCCGGGG CGCTGTTCAA GGCACTGGAG
CCCATTGCCG CGTCCGGCCT CAACATGCTT AAACTGGAGT CCCGGCCGGC CCGGCACAAG
AACTGGAGCT ACGTGTTTTT CGTGGACCTG GAGGGCCATG TCGAAAACGA GAAGGTGAAA
CAGTGCCTGG CAAAAATGGA GGCCTTCTGC CAGTTCATCA AAATCCTGGG CGCTTACCCG
GTAGCCCTGT CGGACGCATG A
 
Protein sequence
MSAESSGQDE KIANLRRSID EIDDTILDLL NRRVSLAEAI GTLKTQTGNR VMDKAREESI 
LQRLAGLNPG PLSSEMLRRI FVDIIAASRQ AQEPKRISFL GPEATFTHVA ALAFFNELDT
FVPHPSIRDV FDDVEKGTSR YGVVPVENSI EGAVNHTLDL FLESELHICA ESYLAISHDL
LSKSGDLEKI HTIYSHPQPF AQCRTWLKTH LPHAELVECG STSQAAQKAL LADDAAAIAG
SAAARLYDLK VAAPAIQDAV RNTTRFLVIG RDAPRPTGND KTSILFVTAH IPGALFKALE
PIAASGLNML KLESRPARHK NWSYVFFVDL EGHVENEKVK QCLAKMEAFC QFIKILGAYP
VALSDA