Gene Gura_4302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4302 
Symbol 
ID5166809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4969645 
End bp4970673 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content58% 
IMG OID640551781 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001233018 
Protein GI148266312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000170599 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTATCG TCATGAGCTA CAATGCCGGA GAGGAAGAGA TCGATGCCGT CGTCAAGGCG 
GTTGAGGAGA TGGGCTATAA GGCCAAGCCG ATTCCCGGCG GTGAACGGAC CGCTATCGGC
GTTCTGGGGA ATACCGGCTA TGTGGACGAT GTGATCATCA GGGACCTGCC CGGTGTGCAG
GAGGTTATCC ATGTCTCCAA ACCCTACAAG CTTGTTTCCC GGGCTTTTCA CCCGCAGAGC
AGCATCATAA ATGTTTGCGG GGTGGAAATC GGCGAGGGGT GCCGACCGGT TGTCGCCGCT
GGTCCATGCG CGGTGGAAAG TGAAGAGCAG ATCGTCAAAA CCGCCCTGGC GGTCAAGGCG
GCCGGCGCCG ATCTGTTACG CGGCGGCGCT TTCAAGCCGA GAACCGGCCC CCATACGTTT
CAGGGGTTAA GAGAAGAAGG TTTGCGACTG TTGGCCCTGG CCGGCAAAGA GAGTGGCCTC
CCCATCGTCA CTGAGGTGAT GAGCCCGGAA AGTGTCGGGA TTGTGGCGGA ATACGCCGAC
CTCCTCCAGG TAGGTGCGCG TAATATGCAG AACTTTGACC TGTTGCGGGA GGTGGGCCGT
ATCGAGAAAC CGGTCCTCCT CAAGCGGGGG ATGAGCGCTA CCATCGAAGA GTTTCTTGCT
GCCGCGGAAT ACATCCTGGC GGAGGGGAAT CCCAACGTCA TCCTTTGCGA GCGCGGCATT
CGCACCTTCG AAACCGCTAC CCGCAATACC CTCGACCTTT CGGTGGTGCC GCTCATCAAG
GAATTGTCCC ATCTGCCCAT CATGGTTGAT CCCTCCCATG CCACCGGCAA ACGAAGCCTT
GTCCCTCCCA TGTCGAAAGC CGCCCTGGTA GCGGGAGCCC ACGGCATTCT CGTTGAGGTT
CATCCGGAAC CGGAGAAAGC GCTTTCCGAT GGTCCGCAAT CTTTGACTTT CCAGGGCTTT
GACAAGCTGA TGGAGGAGGT AAGAAAGCTT AACCAGTTCC TTGGCTACGG CGCTGAAAAA
GACGCTTGA
 
Protein sequence
MLIVMSYNAG EEEIDAVVKA VEEMGYKAKP IPGGERTAIG VLGNTGYVDD VIIRDLPGVQ 
EVIHVSKPYK LVSRAFHPQS SIINVCGVEI GEGCRPVVAA GPCAVESEEQ IVKTALAVKA
AGADLLRGGA FKPRTGPHTF QGLREEGLRL LALAGKESGL PIVTEVMSPE SVGIVAEYAD
LLQVGARNMQ NFDLLREVGR IEKPVLLKRG MSATIEEFLA AAEYILAEGN PNVILCERGI
RTFETATRNT LDLSVVPLIK ELSHLPIMVD PSHATGKRSL VPPMSKAALV AGAHGILVEV
HPEPEKALSD GPQSLTFQGF DKLMEEVRKL NQFLGYGAEK DA