Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_38100 |
Symbol | hppD |
ID | 7762701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3854554 |
End bp | 3856452 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806675 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002800927 |
Protein GI | 226945854 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.749235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGTT CCATTGCCAC CGTCTCGCTC AGCGGAACCC TGCCGGAAAA GCTCGAGGCG GTCGCCGCCG CCGGCTTCGA TGGCGTGGAA ATCTTCGAGA ACGACTTGCT CTACCACGGC GCCAGCCCGC GCGAGGTACG CCGGATGGCG GCCGACCTGG GCCTGGCCAT CACGTTGTTC CAGCCGTTTC GCGACTTCGA GGGCTGCCGG CGCGACCGGC TGCAGCGCAA CCTCGACCGC GCCGAGCGCA AGTTCGACCT GATGCAGGAA CTGGGCACCG ACCTGGTGCT GGTGTGCAGC AACGTCGCCG GGGATTCCCT GGGCGACCGG CAGATCCTCG TCGACGACCT GCGGCTGCTG GCCGAGCGGG CCGCCGCGCG CCGCCTGCGC ATCGGCTACG AAGCGCTGGC CTGGGGGCGC CACGTCAACA CCTGGCAACA GGTCTGGGAC ATCGTGCGGG AAGCCGACCA TCCCGATCTG GGCATGATCC TCGACAGCTT TCACACCCTG TCGCTGAAAG GCGATCCGAG TGCCATCGCC GAGGTGCCCG GAGAAAAGAT CTTCTTCGTG CAGATGGCCG ATGCGCCGAT CCTGGCCATG GATGTGCTGG AATGGAGCCG GCATTTCCGC TGTTTCCCGG GGCAGGGCGA GTTCGACCTG GCGGGCTTTC TCGCGCCGAT CCTCAAGAGC GGCTACCGCG GGCCGCTGTC GCTGGAAATC TTCAACGACG GCTTCCGCGC CGCGCCGACC CGCGGCAACG CGCTGGACGG CTACCGCTCG CTGCTCTACC TGGAGGAGAA GACCCGCCTG CTGCTGGAGC GCCAGGGACG GCCGCCAGAG CCCGGGGTGC TGTTCGCGCC GCCGCCGGCC AGTCGCTACG ACGGCATCGA GTTCCTCGAG TTCGCCGTCG ACGACGACCA TGCCGCCCGA CTGGGCCAGT GGCTGACCCG TCTCGGTTTC GTCGAGGCCG GCCGCCACCG CTCGAAGAAC GTCAGCCTGC TGCGCCAGGG CGACATCAAC CTGGTGCTCA ACGCCGAGCC CTATTCCTTC GCCCACGGCT ATTTCGAGGC CCATGGCCCG TCGCTGTGCG CCACCGCGCT GCGGGTGCAG GACAGCCATC GAGCGCTGGA GCGCGCCTGC GGCTTCGGCG GACAACCCTA CCGTGGCCTG GTCGGTCCCA ACGAGCGCGA GATTCCGGCG GTGCGCGCGC CGGACAGCAG CCTGATCTAT CTGGTCGACC GCGATGCCGA GGGCCACAGC ATCTACGAGA CCGATTTCGA GCTGAAGGGC GGCACCTGCG CCGGCGCGCT GCAGCGCATC GACCACGTGG CCATGGCCCT GCCGGCCGAG GCCATGGATT CCTGGGTGCT GTTCTACAAG AGCCTGTTCG ACTTCGAGGC CGACGACGAA GTGGTGCTGC CCGATCCCTA CGGTCTGGTC AAGAGCCGCG CCGTGCGCAG CCGCTGCTGC TCGGTGCGCC TGCCGCTGAA CATCTCGGAG AACCGCAACA CGGCCATCTC GCGCTCGCTG TCGAGCTATC GCGGCTCCGG CGTGCACCAC ATCGCTTTCT CCTGCGACGA CATCTTCGTC GCCGTCGAGC TGGCCAAGGA GGCCGGCGTG CCGCTGCTGG AGATTCCGCT GAACTACTAC GACGACCTGG CCGCGCGCTT CGATTTCGAC GACGACTTCC TCAGCAGGCT GGCCTACTTC AACATCCTCT ACGACCGCGA TGCGCAGGGC GGCGAGCTGT TCCACGTCTA TACCGAGCCG TTCGCCGAGC GTTTCTTCTT CGAGATCCTG CAGCGCAGGG ACTACGCCGG CTACGGGGCG GCCAACGTCG CCGTGCGCCT GGCGGCCATG GCCCAGGCGC GCGACGGGCA CGGCAAGCCC AGGCTGTGA
|
Protein sequence | MHRSIATVSL SGTLPEKLEA VAAAGFDGVE IFENDLLYHG ASPREVRRMA ADLGLAITLF QPFRDFEGCR RDRLQRNLDR AERKFDLMQE LGTDLVLVCS NVAGDSLGDR QILVDDLRLL AERAAARRLR IGYEALAWGR HVNTWQQVWD IVREADHPDL GMILDSFHTL SLKGDPSAIA EVPGEKIFFV QMADAPILAM DVLEWSRHFR CFPGQGEFDL AGFLAPILKS GYRGPLSLEI FNDGFRAAPT RGNALDGYRS LLYLEEKTRL LLERQGRPPE PGVLFAPPPA SRYDGIEFLE FAVDDDHAAR LGQWLTRLGF VEAGRHRSKN VSLLRQGDIN LVLNAEPYSF AHGYFEAHGP SLCATALRVQ DSHRALERAC GFGGQPYRGL VGPNEREIPA VRAPDSSLIY LVDRDAEGHS IYETDFELKG GTCAGALQRI DHVAMALPAE AMDSWVLFYK SLFDFEADDE VVLPDPYGLV KSRAVRSRCC SVRLPLNISE NRNTAISRSL SSYRGSGVHH IAFSCDDIFV AVELAKEAGV PLLEIPLNYY DDLAARFDFD DDFLSRLAYF NILYDRDAQG GELFHVYTEP FAERFFFEIL QRRDYAGYGA ANVAVRLAAM AQARDGHGKP RL
|
| |