Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1888 |
Symbol | |
ID | 5733777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2276722 |
End bp | 2277822 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279032 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001544659 |
Protein GI | 159898412 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAGCC CAACATCACT AGACACCACG CCGCAACTCG ACGATTTCGA TTACGTTGAA TTCTATGTTG GAAATGCTCG TCAGACTGCC CACTTTTTAC GCACCGCCTT TGGCTTCAAA CCAATTGCCT ACGCTGGACT CGAAACAGGC GTACGCGATC GCGCCTCGAT TTTGCTGCAA CAAGGTGCGA TTCGCCTGAT CATCACCGAA GCACTCGACC CTGAAAGCCC AATCGCCGAC CACGTAAAAT TGCATGGCGA TAGTATTAAG GATATTGCCT TTACGGTTGC CAATGTGCAT AGCGCCTTCG AGGCCGCCGT TAAACGTGGT GCCCGCCCAA TCTTAGAACC AGTGACAATC GAAAGCCCCC AAGGAAGCAT CATCAAGGCC ACCATTGGCA CCTATGGTGA TACAACCCAC TCATTGATCC AGCGAGTCGA TCTTGCTGAT AATGCTTTCC CCCAATTTCA GCCAATCGAA AATCCGGCTC ACGTAATTGA TGGCGGCTTC TCAGTCGTCG ATCACGTTGC GATTAGCCTA GAGCCAGGTC GCTTGGCCGA ATGGGTCGAT TTCTACATCA ATGTGCTTGG TTTCCATCAA TCGCACGAAG AAAATATTGT TACCGAATAT AGTGGCATGA ACTCGCGGGT CGTCCAGAAC CACGCTGGCA CGATTAAATT CCCTATGCAA GAGCCGATTC AAGGCAAACG GCGTTCGCAA GTTGAAGAAT TCTTGACCTT TCATCATGGA GCCGGAGCCC AGCACTTGGC CATCCTCACT GATGATATTA TTCACTCGAT CCAGACGCTG CGGGCCAACG GAATCGAGTT TGTGCGCACG CCCGCAACCT ACTATGAAAA TCTCCAAGAG CGGGTTGGCT TAATTGATGA AGATATTGCG ATGCTGCGCG ATTTGCATAT TTTGGTAGAT CGCGATAGCT CGGGCTATTT GCTGCAAATT TTCACCAAGC CGCTGCAAAG CCGACCAACG ATGTTCTTTG AAATTATTCA GCGCAAAAAT GCCATTGGCT TCGGCAGTGG CAACATCAAA GCACTCTTTG CCGCCGTCGA ACGCGAGCAA GCTTTGCGGG GAAATCTCTA A
|
Protein sequence | MTSPTSLDTT PQLDDFDYVE FYVGNARQTA HFLRTAFGFK PIAYAGLETG VRDRASILLQ QGAIRLIITE ALDPESPIAD HVKLHGDSIK DIAFTVANVH SAFEAAVKRG ARPILEPVTI ESPQGSIIKA TIGTYGDTTH SLIQRVDLAD NAFPQFQPIE NPAHVIDGGF SVVDHVAISL EPGRLAEWVD FYINVLGFHQ SHEENIVTEY SGMNSRVVQN HAGTIKFPMQ EPIQGKRRSQ VEEFLTFHHG AGAQHLAILT DDIIHSIQTL RANGIEFVRT PATYYENLQE RVGLIDEDIA MLRDLHILVD RDSSGYLLQI FTKPLQSRPT MFFEIIQRKN AIGFGSGNIK ALFAAVEREQ ALRGNL
|
| |