Gene VC0395_A0960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0960 
SymbolhppD 
ID5136020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp989932 
End bp991041 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content48% 
IMG OID640532418 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001216906 
Protein GI147673060 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACTT CAACTCAACA CACCAAGGAA ACGATGATGA TGGATAGCGT AAACCCACTC 
GGTACAGATG GCTTTGAATT TGTGGAATAT ACTGCCGCTG ATGAGCAGGG CATTGCCAGC
CTCAAACACC TCTTCACCTC TCTGGGCTTT GCCGAAATTG CTAAACACCG TTCTAAAGAA
GCTTGGCTGT ATCGTCAGGG TGACATCAAC TTTATTGTCA ACGCACAGCC TCGTAGCCAA
GCTGAAGCGT TTGCTAAACA GCATGGCCCT TCGGTATGCG GGATGGCGTT TCGCGTGAAA
GACGCGGCGA TTGCTCTCAA GCATGCGCAG GCCAATGGTG CCGTCGAGTA CAAAACTGAG
ATTGGGCCTA TGGAGTTAAG CATTCCGGCG GTGATCGGGA TTGGCGATAG CTTGCTCTAT
TTTGTGGATC GTTATGGCGA TCGCAGCATC TATGATGTCG ATTTTCATTT CTACCCTGAT
AGCAAAGAGC GCCTTGCCAA AGCGCAAGTG GGGTTGTATG AAATTGACCA CCTCACCCAC
AACGTGAAAC GTGGCAACAT GAACCTGTGG GCAGGCTTTT ATGAGCGGAT TGGTAACTTC
CGTGAAATTC GCTACTTTGA TATTGAGGGC AAACTGACAG GGTTGGTGAG CCGAGCCATG
ACCGCGCCCT GTGGCAAAAT CCGTATTCCG ATCAACGAGT CCTCTGACGA TAAATCGCAA
ATCGAAGAGT TTATTCGTGA GTACAAAGGT GAAGGTATCC AGCATATCGC GCTCAGTACC
GAGGATATTT ACCACACTGT GAAAACCTTG CGTGAACGTG GCATGGACTT TATGCCCACT
CCGGACACCT ATTACGACAA GGTGAATCAG CGAGTGGTGG GACATCAAGA AGATGTGCAA
GCACTGCGTG ACTTACGTAT TTTGATTGAT GGTGCACCGA TGAAAGATGG CATTTTGCTG
CAAATCTTCA CTCAAACTGT GATTGGGCCT GTGTTCTTTG AAATCATTCA GCGCAAAGGT
AATCAAGGAT TTGGTGAAGG TAACTTCAAA GCGCTGTTTG AATCGATTGA AGAAGATCAG
ATCCGCCGTG GAGTATTGAC TGATGCATAA
 
Protein sequence
MVTSTQHTKE TMMMDSVNPL GTDGFEFVEY TAADEQGIAS LKHLFTSLGF AEIAKHRSKE 
AWLYRQGDIN FIVNAQPRSQ AEAFAKQHGP SVCGMAFRVK DAAIALKHAQ ANGAVEYKTE
IGPMELSIPA VIGIGDSLLY FVDRYGDRSI YDVDFHFYPD SKERLAKAQV GLYEIDHLTH
NVKRGNMNLW AGFYERIGNF REIRYFDIEG KLTGLVSRAM TAPCGKIRIP INESSDDKSQ
IEEFIREYKG EGIQHIALST EDIYHTVKTL RERGMDFMPT PDTYYDKVNQ RVVGHQEDVQ
ALRDLRILID GAPMKDGILL QIFTQTVIGP VFFEIIQRKG NQGFGEGNFK ALFESIEEDQ
IRRGVLTDA