Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0960 |
Symbol | hppD |
ID | 5136020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 989932 |
End bp | 991041 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640532418 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001216906 |
Protein GI | 147673060 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACTT CAACTCAACA CACCAAGGAA ACGATGATGA TGGATAGCGT AAACCCACTC GGTACAGATG GCTTTGAATT TGTGGAATAT ACTGCCGCTG ATGAGCAGGG CATTGCCAGC CTCAAACACC TCTTCACCTC TCTGGGCTTT GCCGAAATTG CTAAACACCG TTCTAAAGAA GCTTGGCTGT ATCGTCAGGG TGACATCAAC TTTATTGTCA ACGCACAGCC TCGTAGCCAA GCTGAAGCGT TTGCTAAACA GCATGGCCCT TCGGTATGCG GGATGGCGTT TCGCGTGAAA GACGCGGCGA TTGCTCTCAA GCATGCGCAG GCCAATGGTG CCGTCGAGTA CAAAACTGAG ATTGGGCCTA TGGAGTTAAG CATTCCGGCG GTGATCGGGA TTGGCGATAG CTTGCTCTAT TTTGTGGATC GTTATGGCGA TCGCAGCATC TATGATGTCG ATTTTCATTT CTACCCTGAT AGCAAAGAGC GCCTTGCCAA AGCGCAAGTG GGGTTGTATG AAATTGACCA CCTCACCCAC AACGTGAAAC GTGGCAACAT GAACCTGTGG GCAGGCTTTT ATGAGCGGAT TGGTAACTTC CGTGAAATTC GCTACTTTGA TATTGAGGGC AAACTGACAG GGTTGGTGAG CCGAGCCATG ACCGCGCCCT GTGGCAAAAT CCGTATTCCG ATCAACGAGT CCTCTGACGA TAAATCGCAA ATCGAAGAGT TTATTCGTGA GTACAAAGGT GAAGGTATCC AGCATATCGC GCTCAGTACC GAGGATATTT ACCACACTGT GAAAACCTTG CGTGAACGTG GCATGGACTT TATGCCCACT CCGGACACCT ATTACGACAA GGTGAATCAG CGAGTGGTGG GACATCAAGA AGATGTGCAA GCACTGCGTG ACTTACGTAT TTTGATTGAT GGTGCACCGA TGAAAGATGG CATTTTGCTG CAAATCTTCA CTCAAACTGT GATTGGGCCT GTGTTCTTTG AAATCATTCA GCGCAAAGGT AATCAAGGAT TTGGTGAAGG TAACTTCAAA GCGCTGTTTG AATCGATTGA AGAAGATCAG ATCCGCCGTG GAGTATTGAC TGATGCATAA
|
Protein sequence | MVTSTQHTKE TMMMDSVNPL GTDGFEFVEY TAADEQGIAS LKHLFTSLGF AEIAKHRSKE AWLYRQGDIN FIVNAQPRSQ AEAFAKQHGP SVCGMAFRVK DAAIALKHAQ ANGAVEYKTE IGPMELSIPA VIGIGDSLLY FVDRYGDRSI YDVDFHFYPD SKERLAKAQV GLYEIDHLTH NVKRGNMNLW AGFYERIGNF REIRYFDIEG KLTGLVSRAM TAPCGKIRIP INESSDDKSQ IEEFIREYKG EGIQHIALST EDIYHTVKTL RERGMDFMPT PDTYYDKVNQ RVVGHQEDVQ ALRDLRILID GAPMKDGILL QIFTQTVIGP VFFEIIQRKG NQGFGEGNFK ALFESIEEDQ IRRGVLTDA
|
| |