Gene Ava_4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4649 
Symbol 
ID3679864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5812759 
End bp5813817 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content46% 
IMG OID637720005 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_325141 
Protein GI75910845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.376747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTAG TCATGAAAGT CGGTTCCCCA GAAGTGGAAA TCAACCGGAT TAATGATGAA 
TTAACCAGTT GGGGGCTGAC ACCAGAAAAA ATTATCGGCA AACACAAAGT AGTAATTGGC
TTAGTAGGTG AAACCGCCGA TTTAGACCCC TTACAAATCC AAGAAGTTAG CCCTTGGATT
GAGCAAGTAT TACGGGTAGA ACTGCCTTAT AAACGCGCTA GCCGCCAATA TCGTCACGGT
GAAGCCTCGG AAGTAGTTGT TAATACTCCA GATGGTCCAG TCGTATTTGG TGAAAACCAA
GCTTTAGTGG TGGTTGCTGG TCCCTGTTCC GTCGAAAATG AGGAAATGAT TATTGAGACG
GCGCAGCGCG TCAAGGCAGC CGGCGCTAAG TTTTTACGCG GTGGTGCATA TAAGCCCCGT
ACTTCACCTT ACGCTTTTCA AGGTCACGGC GAAAGTGCTT TGGAATTGTT GGCAAAGGCG
CGGGATGTTA GCGGCTTAGG TATCATCACC GAAGTGATGG ACGCGTCGGA ACTGGATATT
ATTGCAGAAG TGGCTGATGT TATCCAGGTA GGTGCAAGAA ATATGCAGAA TTTCTCCCTA
CTGAAAAAAG TAGGGGCGCA GCCAAAACCA GTATTACTGA AGCGAGGGAT GGCAGCTACT
ATCGAAGATT GGTTGATGGC AGCCGAGTAC GTTCTAGCAG CAGGTAACCC TAATGTAATT
TTATGTGAAC GAGGTATTCG TACTTTTGAC CGTCAATATA CACGTAACAC TTTAGATTTG
TCGGTAGTTC CAGTATTGAG GAAATTAACT CACCTACCAA TTATGATTGA TCCCAGTCAT
GGTACTGGTT GGGCTGAGTA TGTACCATCA ATGGCGATGG CAGCGATCGC AGCTGGTACT
GATTCTTTGA TGATTGAAGT ACACCCCAAT CCCAAAAAAG CCCTATCCGA CGGGCCGCAA
TCCCTCACAC CAGAACATTT TGACCGCTTA ATGCAGGAAT TAGCAGTCAT TGGTAAAGCT
GTAGGACGCT GGCCACAACC AGCAGTTGTA ACTGCATAA
 
Protein sequence
MIVVMKVGSP EVEINRINDE LTSWGLTPEK IIGKHKVVIG LVGETADLDP LQIQEVSPWI 
EQVLRVELPY KRASRQYRHG EASEVVVNTP DGPVVFGENQ ALVVVAGPCS VENEEMIIET
AQRVKAAGAK FLRGGAYKPR TSPYAFQGHG ESALELLAKA RDVSGLGIIT EVMDASELDI
IAEVADVIQV GARNMQNFSL LKKVGAQPKP VLLKRGMAAT IEDWLMAAEY VLAAGNPNVI
LCERGIRTFD RQYTRNTLDL SVVPVLRKLT HLPIMIDPSH GTGWAEYVPS MAMAAIAAGT
DSLMIEVHPN PKKALSDGPQ SLTPEHFDRL MQELAVIGKA VGRWPQPAVV TA