Gene Avin_38100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38100 
SymbolhppD 
ID7762701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3854554 
End bp3856452 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content68% 
IMG OID643806675 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002800927 
Protein GI226945854 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.749235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGTT CCATTGCCAC CGTCTCGCTC AGCGGAACCC TGCCGGAAAA GCTCGAGGCG 
GTCGCCGCCG CCGGCTTCGA TGGCGTGGAA ATCTTCGAGA ACGACTTGCT CTACCACGGC
GCCAGCCCGC GCGAGGTACG CCGGATGGCG GCCGACCTGG GCCTGGCCAT CACGTTGTTC
CAGCCGTTTC GCGACTTCGA GGGCTGCCGG CGCGACCGGC TGCAGCGCAA CCTCGACCGC
GCCGAGCGCA AGTTCGACCT GATGCAGGAA CTGGGCACCG ACCTGGTGCT GGTGTGCAGC
AACGTCGCCG GGGATTCCCT GGGCGACCGG CAGATCCTCG TCGACGACCT GCGGCTGCTG
GCCGAGCGGG CCGCCGCGCG CCGCCTGCGC ATCGGCTACG AAGCGCTGGC CTGGGGGCGC
CACGTCAACA CCTGGCAACA GGTCTGGGAC ATCGTGCGGG AAGCCGACCA TCCCGATCTG
GGCATGATCC TCGACAGCTT TCACACCCTG TCGCTGAAAG GCGATCCGAG TGCCATCGCC
GAGGTGCCCG GAGAAAAGAT CTTCTTCGTG CAGATGGCCG ATGCGCCGAT CCTGGCCATG
GATGTGCTGG AATGGAGCCG GCATTTCCGC TGTTTCCCGG GGCAGGGCGA GTTCGACCTG
GCGGGCTTTC TCGCGCCGAT CCTCAAGAGC GGCTACCGCG GGCCGCTGTC GCTGGAAATC
TTCAACGACG GCTTCCGCGC CGCGCCGACC CGCGGCAACG CGCTGGACGG CTACCGCTCG
CTGCTCTACC TGGAGGAGAA GACCCGCCTG CTGCTGGAGC GCCAGGGACG GCCGCCAGAG
CCCGGGGTGC TGTTCGCGCC GCCGCCGGCC AGTCGCTACG ACGGCATCGA GTTCCTCGAG
TTCGCCGTCG ACGACGACCA TGCCGCCCGA CTGGGCCAGT GGCTGACCCG TCTCGGTTTC
GTCGAGGCCG GCCGCCACCG CTCGAAGAAC GTCAGCCTGC TGCGCCAGGG CGACATCAAC
CTGGTGCTCA ACGCCGAGCC CTATTCCTTC GCCCACGGCT ATTTCGAGGC CCATGGCCCG
TCGCTGTGCG CCACCGCGCT GCGGGTGCAG GACAGCCATC GAGCGCTGGA GCGCGCCTGC
GGCTTCGGCG GACAACCCTA CCGTGGCCTG GTCGGTCCCA ACGAGCGCGA GATTCCGGCG
GTGCGCGCGC CGGACAGCAG CCTGATCTAT CTGGTCGACC GCGATGCCGA GGGCCACAGC
ATCTACGAGA CCGATTTCGA GCTGAAGGGC GGCACCTGCG CCGGCGCGCT GCAGCGCATC
GACCACGTGG CCATGGCCCT GCCGGCCGAG GCCATGGATT CCTGGGTGCT GTTCTACAAG
AGCCTGTTCG ACTTCGAGGC CGACGACGAA GTGGTGCTGC CCGATCCCTA CGGTCTGGTC
AAGAGCCGCG CCGTGCGCAG CCGCTGCTGC TCGGTGCGCC TGCCGCTGAA CATCTCGGAG
AACCGCAACA CGGCCATCTC GCGCTCGCTG TCGAGCTATC GCGGCTCCGG CGTGCACCAC
ATCGCTTTCT CCTGCGACGA CATCTTCGTC GCCGTCGAGC TGGCCAAGGA GGCCGGCGTG
CCGCTGCTGG AGATTCCGCT GAACTACTAC GACGACCTGG CCGCGCGCTT CGATTTCGAC
GACGACTTCC TCAGCAGGCT GGCCTACTTC AACATCCTCT ACGACCGCGA TGCGCAGGGC
GGCGAGCTGT TCCACGTCTA TACCGAGCCG TTCGCCGAGC GTTTCTTCTT CGAGATCCTG
CAGCGCAGGG ACTACGCCGG CTACGGGGCG GCCAACGTCG CCGTGCGCCT GGCGGCCATG
GCCCAGGCGC GCGACGGGCA CGGCAAGCCC AGGCTGTGA
 
Protein sequence
MHRSIATVSL SGTLPEKLEA VAAAGFDGVE IFENDLLYHG ASPREVRRMA ADLGLAITLF 
QPFRDFEGCR RDRLQRNLDR AERKFDLMQE LGTDLVLVCS NVAGDSLGDR QILVDDLRLL
AERAAARRLR IGYEALAWGR HVNTWQQVWD IVREADHPDL GMILDSFHTL SLKGDPSAIA
EVPGEKIFFV QMADAPILAM DVLEWSRHFR CFPGQGEFDL AGFLAPILKS GYRGPLSLEI
FNDGFRAAPT RGNALDGYRS LLYLEEKTRL LLERQGRPPE PGVLFAPPPA SRYDGIEFLE
FAVDDDHAAR LGQWLTRLGF VEAGRHRSKN VSLLRQGDIN LVLNAEPYSF AHGYFEAHGP
SLCATALRVQ DSHRALERAC GFGGQPYRGL VGPNEREIPA VRAPDSSLIY LVDRDAEGHS
IYETDFELKG GTCAGALQRI DHVAMALPAE AMDSWVLFYK SLFDFEADDE VVLPDPYGLV
KSRAVRSRCC SVRLPLNISE NRNTAISRSL SSYRGSGVHH IAFSCDDIFV AVELAKEAGV
PLLEIPLNYY DDLAARFDFD DDFLSRLAYF NILYDRDAQG GELFHVYTEP FAERFFFEIL
QRRDYAGYGA ANVAVRLAAM AQARDGHGKP RL