Gene Ava_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3980 
Symbol 
ID3679654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4933158 
End bp4934234 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content42% 
IMG OID637719332 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_324480 
Protein GI75910184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.005496 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACCCT TCACTCCATC TGCGGAAATA GAACAAGTAA ATGAAGAAAT TCGTCGCCAT 
GAACTTACAC CAGAAATCTG TGTAGGTCAT CACAAAGTAG TAATCGGTTT GGTGGGAGAT
ACCTCAGAAC TTGATCCACG CCAAATTCAA AATTTGAACC CTTTTATAGA ACAAGTGATT
AGGATTAAAA AGCCTTTCAA AAGAGCTTCT TTAGAATTTC GTTACGGAGA ACATAGTGAG
GTTGTAGTAC CAACACCTAA TGGGCCTATA ACTTTTGGGC AAAACCATCC TGTAGTTGTA
GTTGCCGGCC CTTGTTCGGT AGAAAACGAA GAAATGATCG TCGAGACAGC CCAAGCGGTG
AAAGAATACG GCGCACAATT TTTACGTGGT GGCGCATATA AACCCCGTAC ATCACCTTAT
GCTTTCCAAG GTCATGGTGA GAGCGCTCTA GCTTTGTTAG CAAAAGCTAA AGAAGCTACA
GGTTTAGGAA TCATTACAGA AATCATGGAC GCAGATGACT TAGATAAGCT CATCGAAGTA
GCAGATGTGT TGCAAATTGG GGCCCGAAAT ATGCAGAATT TTTCACTTCT CAAAAAAATA
GGAGCTACTA CTAAGCCAGT GCTACTGAAG CGGGGACTGT CAGCCACTAT CGAAGATTGG
TTGATGTCAG CAGAGTATAT TTTAGCCGGA GGTAATCCAA ACGTCATACT ATGTGAGCGA
GGAATTCGCA CTTTTGACCG ACAGTTTACA AGAAATACGC TGGATATATC TGCAATTCCA
GTTTTGCGAA CCCTGACACA CTTACCTATT ATGATTGATC CTAGTCATGC CACCGGTAAA
TCCGAATTTG TGCCAACTAT GACAATGGCA TCTATAGCTG CTGGTGCAGA TTCCTTGATG
ATTGAAGTTC ACCCCAATCC AGCTAAAGCA CTTTCAGATG GGCCTCAATC ACTCACATTG
GAAGGATTTA AGGAATTAAT GCACGAAATA ACTGCCCTAA GTAAATTCTT TGGTCGTTCG
CCTCACAACA ATAGTACAAA TCTAAAGCCA AAAGCTATCT CATTGGTGAG CTATTAA
 
Protein sequence
MKPFTPSAEI EQVNEEIRRH ELTPEICVGH HKVVIGLVGD TSELDPRQIQ NLNPFIEQVI 
RIKKPFKRAS LEFRYGEHSE VVVPTPNGPI TFGQNHPVVV VAGPCSVENE EMIVETAQAV
KEYGAQFLRG GAYKPRTSPY AFQGHGESAL ALLAKAKEAT GLGIITEIMD ADDLDKLIEV
ADVLQIGARN MQNFSLLKKI GATTKPVLLK RGLSATIEDW LMSAEYILAG GNPNVILCER
GIRTFDRQFT RNTLDISAIP VLRTLTHLPI MIDPSHATGK SEFVPTMTMA SIAAGADSLM
IEVHPNPAKA LSDGPQSLTL EGFKELMHEI TALSKFFGRS PHNNSTNLKP KAISLVSY