Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4666 |
Symbol | |
ID | 3679822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5829058 |
End bp | 5830092 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637720022 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_325158 |
Protein GI | 75910862 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG ATCATGTTCA TTTCTATGTC GAAGATGCCA AGGTGTGGCG GGATTGGTTC CTCAAATATT TGGGCTTTAC CGCAGTAACT AGTAATATCA GTTCTCTACA CACCTGTACA GAGGTAGTGA AAAGTGGAGA TGTCTGCTTT TTGCTATCTT CTCCATTATT GCCTACTAGT CCCGTAGCGG AATTTCTGCG TCAACATCCT CCTGGTGTGG CAGATGTGGC TTTTGCGGTA AAAGATGTAG AAACAGCGAT CGCTCATGCC GAGGCTCACG GTGCTACAAT CCTACAATCC ATAGGGGAAC GTCGCATTGG TAATATCTCC CGCAAGTGTG GCAAAATTGC GGCTTGGGGT GGCTTAACTC ATACGTTAAT CGAAAAATTA AGTCCAGATA GCCAAATAAT TGCATCACCT AATTTTATTA CCGCTATAGA CCATATAGTC TTAAACGTCG CCATAGGTGA GTTAGAAGCG GCTGTAGCTT GGTATGAGAA AATTCTCGAT TTGCAACCCC GACAAGCTTT TAAAATCCAA ACCGATCGCT CGGCGCTGCA CAGCCAAGTC ATGGTTTCCC GTGATGGTAG TGTACAATTG CCAATTAACG AGCCAGCATC CAGCAATTCC CAAATACAAG AATTTCTCAA CTTTAACCGA GGAGCAGGTA TTCAACATAT TGCCTTGCAA ACACAAAATA TTGTCGATGC GATCGCCCAA TTTCGTAACG GTGGTTTACC ATTGCTTTCA GTTCCGCAAA CTTATTACAC ACAACTCAAA CAGCGTCTAG AAATCCCTCT CTCATCTACA GAATTAGAGG CGATCGCTCA ACAAGAAATT CTGGTAGACT GGCGAAAAGA TAATCAAAAT GGGGTATTAT TGCAAATTTT CAGCCAACCC ATATTTGCAG AACCAACTTT TTTCTTAGAA TTTATTGAAC GTCGTTCCCA AGCACAAGGC TTCGGTGAAG GTAACTTCCG CGCCTTATTT GAAGCCATCG AAAGCGAACA AATGAAGCGG GGAACTCTTC AATAA
|
Protein sequence | MKIDHVHFYV EDAKVWRDWF LKYLGFTAVT SNISSLHTCT EVVKSGDVCF LLSSPLLPTS PVAEFLRQHP PGVADVAFAV KDVETAIAHA EAHGATILQS IGERRIGNIS RKCGKIAAWG GLTHTLIEKL SPDSQIIASP NFITAIDHIV LNVAIGELEA AVAWYEKILD LQPRQAFKIQ TDRSALHSQV MVSRDGSVQL PINEPASSNS QIQEFLNFNR GAGIQHIALQ TQNIVDAIAQ FRNGGLPLLS VPQTYYTQLK QRLEIPLSST ELEAIAQQEI LVDWRKDNQN GVLLQIFSQP IFAEPTFFLE FIERRSQAQG FGEGNFRALF EAIESEQMKR GTLQ
|
| |