Gene Avin_50170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50170 
SymbolhppD 
ID7763868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5085206 
End bp5086243 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID643807848 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002802082 
Protein GI226947009 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA AGAACAATCC CATCGGCCTG CGCGGCATCG AGTTCACCGA ATTCACCGGC 
GGGACGCTCG AACAGCTCGA CGCCCTGTTC TGGGCCTTCG GCTTCTCGAA GAAGTATCGT
CACCCGAAGC TGGACATCAG CATCTACAAC CAGAACGGCA TCAACTTCCT GCTGAACGGC
GAGCGTGAAG GCTTCTCCGG TCATTTCGCC AAACTGCACG GCCCGTCGAT CAGCTCCATG
GGCTGGCGTG TCGACGATGC CGCATTCGCC AGACAGGAAG CCGTCCGTCG CGGCGCGCGC
GCGGCCGATC CCAAGGACTG CGACCTGCCC TACCCGGCCA TCTACGGCAT CGGCGACAGC
CTGATCTATT TCATCGAGCG TTTCGGCGCC AGGGGCTCGA TCTACGCAAC CGACTTCGTT
CCCCACGAGC AGGCCCGGCT CCAGCCGGAC AAAGGCTTCC TGGAGATCGA CCACCTGACC
AACAACGTGC CGCAAGGCCA GATGGAGCAG TGGGGGGCCT TCTACAAGGA GATCTTCGGT
TTCACCGAAG TGCGCTACTT CGACATCAAG GGCGTGAAGA CCGGCCTGAC CAGCTATGCG
CTGCGCTCGC CGGACGGCAG CTTCTGCATC CCGATCAACC AGCCGAAGGA CGACAAGGAC
CAGATCTCCG AGTACCTGGC CGAGTACAAC GGCCCGGGCG TACAGCACCT GGCCTTCAGC
ACCAACGACA TCCTCGCCTC GCTGGACGCC ATGAAGGGCG GCCCGATCGA GATGCTCGAC
ATCGACGCGA ACTACTACGA CAACGTGTTC CAGCGCCTGC CCAACGTGCG CGAGGACAAG
GAGCGCATCC GCGCCCACCA CGTACTGGTC GACGGCGACC AGGACGGCTA TCTGCTACAG
ATATTCACCA AGAACATCAT CGGCCCGATC TTCGTCGAGA TCATCCAGCG CGAGAACAAC
CTGAGCTTCG GCGAAGGCAA CTTCGGCGCC CTGTTCCGCT CCATCGAGAA GGACCAGGAA
CGTCGCGGCG TAATCTGA
 
Protein sequence
MSDKNNPIGL RGIEFTEFTG GTLEQLDALF WAFGFSKKYR HPKLDISIYN QNGINFLLNG 
EREGFSGHFA KLHGPSISSM GWRVDDAAFA RQEAVRRGAR AADPKDCDLP YPAIYGIGDS
LIYFIERFGA RGSIYATDFV PHEQARLQPD KGFLEIDHLT NNVPQGQMEQ WGAFYKEIFG
FTEVRYFDIK GVKTGLTSYA LRSPDGSFCI PINQPKDDKD QISEYLAEYN GPGVQHLAFS
TNDILASLDA MKGGPIEMLD IDANYYDNVF QRLPNVREDK ERIRAHHVLV DGDQDGYLLQ
IFTKNIIGPI FVEIIQRENN LSFGEGNFGA LFRSIEKDQE RRGVI