Gene Dole_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2034 
Symbol 
ID5694877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2463999 
End bp2465051 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content61% 
IMG OID641264635 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001529915 
Protein GI158522045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000233071 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA GTGGAAGCAT TCGGGCCGCG GTGGTAGGGG CCACGGGATA TGCCGGGGCC 
GAGCTGGTGC GACTGCTGGC CGGGCATTCG GATGTCACGA TCACCGCCAT TACGTCCCGC
CAGTATGCCG GCGTCCCTTT TAACCAGGTC TATCCGGCGG TGGGAACAGC CGTTTCTCTG
GTGTGCGAGA CGTTTGCGCC GGAACCCATC TGTGAGCGGG CCGATATCGT TTTTACCGCG
CTTCCCCACA AACTGCCCAT GAGCATTGTG CCGGAACTGC TGGATCGGGG CGTGCGGGTG
GTGGACCTGT CCGCCGACTT CCGGTTTTCC GATGTGGCGG CCTATGAACG CCATTACCAG
GCCCACACCG CGAAGGAACT CTGCAAAAAG AGCGTTTACG GGCTCTGCGA GGTCTATGGG
GAAAAGATAA AAAAGGCCGA TCTGGTGGGC AATCCGGGCT GTTATCCCAC CAGCGTTCTG
CTGCCGCTGA TTCCGCTGGC CAGGGCCGGG CTGGTCGATA CGAAGATGAT CATCGTGGAT
GCCAAGTCCG GTGTCAGCGG CGCGGGCCGG TCCCCGTCAT TGGGGGTCCA CTTCTGCGAG
GTGAACGAAT CCTTCAAGGC CTATAAAGTG GCGGCTCACC GCCACGCACC GGAGATGGAG
GAGATTCTGG GCGAAGCGGC CGGGACACCG GTCTGCCTGA CCTTTGTGCC CCACCTGGTG
CCCATGACGC GCGGTATGCT GTCCACCATT TACGTGAACC CGGAACAGGC GGTGTCCGAG
CAGGATGTTC GTCAGTGCCT GGCCGATTAT TACAAGGGAC GGCCTTTTGT CCGCCTGTGC
GGGGAGGGGG CCTTTCCGGA AACCCGTTTC GTGCGGGGCA CCAATTTCTG CGACATCGGC
GTTCGCCTGG ATACCCATGC CAACCGCCTG ATCCTGGTCT CCGCCATCGA CAACCTGGTC
AAGGGGGCCG CCGGCCAGGC GGTTCAGAAC ATGAACCTCA TGTTTGGTGT TGACGAGGGC
CGGGGGCTTG ATATGATACC GTTTCCGGTG TGA
 
Protein sequence
MLKSGSIRAA VVGATGYAGA ELVRLLAGHS DVTITAITSR QYAGVPFNQV YPAVGTAVSL 
VCETFAPEPI CERADIVFTA LPHKLPMSIV PELLDRGVRV VDLSADFRFS DVAAYERHYQ
AHTAKELCKK SVYGLCEVYG EKIKKADLVG NPGCYPTSVL LPLIPLARAG LVDTKMIIVD
AKSGVSGAGR SPSLGVHFCE VNESFKAYKV AAHRHAPEME EILGEAAGTP VCLTFVPHLV
PMTRGMLSTI YVNPEQAVSE QDVRQCLADY YKGRPFVRLC GEGAFPETRF VRGTNFCDIG
VRLDTHANRL ILVSAIDNLV KGAAGQAVQN MNLMFGVDEG RGLDMIPFPV