Gene Ava_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4666 
Symbol 
ID3679822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5829058 
End bp5830092 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content43% 
IMG OID637720022 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_325158 
Protein GI75910862 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG ATCATGTTCA TTTCTATGTC GAAGATGCCA AGGTGTGGCG GGATTGGTTC 
CTCAAATATT TGGGCTTTAC CGCAGTAACT AGTAATATCA GTTCTCTACA CACCTGTACA
GAGGTAGTGA AAAGTGGAGA TGTCTGCTTT TTGCTATCTT CTCCATTATT GCCTACTAGT
CCCGTAGCGG AATTTCTGCG TCAACATCCT CCTGGTGTGG CAGATGTGGC TTTTGCGGTA
AAAGATGTAG AAACAGCGAT CGCTCATGCC GAGGCTCACG GTGCTACAAT CCTACAATCC
ATAGGGGAAC GTCGCATTGG TAATATCTCC CGCAAGTGTG GCAAAATTGC GGCTTGGGGT
GGCTTAACTC ATACGTTAAT CGAAAAATTA AGTCCAGATA GCCAAATAAT TGCATCACCT
AATTTTATTA CCGCTATAGA CCATATAGTC TTAAACGTCG CCATAGGTGA GTTAGAAGCG
GCTGTAGCTT GGTATGAGAA AATTCTCGAT TTGCAACCCC GACAAGCTTT TAAAATCCAA
ACCGATCGCT CGGCGCTGCA CAGCCAAGTC ATGGTTTCCC GTGATGGTAG TGTACAATTG
CCAATTAACG AGCCAGCATC CAGCAATTCC CAAATACAAG AATTTCTCAA CTTTAACCGA
GGAGCAGGTA TTCAACATAT TGCCTTGCAA ACACAAAATA TTGTCGATGC GATCGCCCAA
TTTCGTAACG GTGGTTTACC ATTGCTTTCA GTTCCGCAAA CTTATTACAC ACAACTCAAA
CAGCGTCTAG AAATCCCTCT CTCATCTACA GAATTAGAGG CGATCGCTCA ACAAGAAATT
CTGGTAGACT GGCGAAAAGA TAATCAAAAT GGGGTATTAT TGCAAATTTT CAGCCAACCC
ATATTTGCAG AACCAACTTT TTTCTTAGAA TTTATTGAAC GTCGTTCCCA AGCACAAGGC
TTCGGTGAAG GTAACTTCCG CGCCTTATTT GAAGCCATCG AAAGCGAACA AATGAAGCGG
GGAACTCTTC AATAA
 
Protein sequence
MKIDHVHFYV EDAKVWRDWF LKYLGFTAVT SNISSLHTCT EVVKSGDVCF LLSSPLLPTS 
PVAEFLRQHP PGVADVAFAV KDVETAIAHA EAHGATILQS IGERRIGNIS RKCGKIAAWG
GLTHTLIEKL SPDSQIIASP NFITAIDHIV LNVAIGELEA AVAWYEKILD LQPRQAFKIQ
TDRSALHSQV MVSRDGSVQL PINEPASSNS QIQEFLNFNR GAGIQHIALQ TQNIVDAIAQ
FRNGGLPLLS VPQTYYTQLK QRLEIPLSST ELEAIAQQEI LVDWRKDNQN GVLLQIFSQP
IFAEPTFFLE FIERRSQAQG FGEGNFRALF EAIESEQMKR GTLQ