Gene PSPTO_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPSPTO_3551 
SymbolhmgA 
ID1185216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas syringae pv. tomato str. DC3000 
KingdomBacteria 
Replicon accessionNC_004578 
Strand
Start bp4007204 
End bp4008508 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content60% 
IMG OID637394904 
Producthomogentisate 1,2-dioxygenase 
Protein accessionNP_793331 
Protein GI28870712 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.518718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTC ATTCCAGCTT CGAAGCGTTG GCCTATCAAT CGGGCTTCGC CAACCAGTTC 
AGCAGCGAGG CGCTGCCTGG CGCCTTGCCC ATGGGCCAGA ACTCGCCGCA GAAACACCCC
CTGGGCCTGT ATGCCGAGCA GTTCTCAGGG ACCGCTTTCA CCGTCGCCCG CAACGAGGCC
CGACGGACCT GGTTGTACCG CATCAAACCG TCGGCGGCAC ATCCGCGCTA TCAACGCATG
GACCGGCAAA TCACTGGCCG TGAACAAGGC CCGATCAACC CCAATCGACT GCGCTGGAAC
GCGTTCGATA TCCCGGCCGA GCCCGTCGAT TTCATTGACG GGCTGATTGC GCTGGCCAGC
ACTTCAGCGG CAGATCAGGC CGAGGGCGTC AGCGTTTATG TGTACGCAGC CAATACGTCG
ATGCAGCGCG CGTTTTTCAG CGCTGACGGG GAATGGTTGA TCGTGCCGCA ACAGGGGCGA
CTCAGAATCA TCACCGAGAT GGGTGTGCTG GACATCGGCC CGCTGGAAAT CGCCGTGCTG
CCACGCGGCC TGAAATTCAG CGTGCAACTG CTCGACGGCA GCGCTCGGGG TTACCTCTGC
GAGAACCACG GTGGTGTCTT GCGCCTGCCG GAACTGGGGC CGATCGGTAG CAATGGTCTG
GCCAACCCGC GTGATTTTCT GACGCCCGTG GCCTGGTTTG AAGAGCGTGA CGAGCCTGTG
CAACTGGTGC AGAAATTTCT CGGCGAACTG TGGACGACGC AACTGCAACA CTCGCCATTC
GATGTGGTGG GCTGGCATGG CAATAACGTG CCGTACACGT ACGATCTGCG GCGTTTTAAT
ACGATTGGCA CGGTCAGCTA CGATCATCCC GACCCGTCGA TTTTCACCGT ACTGACCTCG
CCAGGCGCCG TCCACGGTCA GGCCAACATC GATTTCGTGA TCTTCCCGCC ACGCTGGATG
GTCGCTGAAA ACACCTTCAG GCCGCCGTGG TTCCATCGCA ACCTGATGAA CGAATTCATG
GGTCTGATCG ACGGCGCCTA CGACGCCAAG GCTGAGGGTT TCATGCCGGG TGGCTCGTCG
CTGCACAACT GCATGAGCGC CCACGGCCCG GACAATATCA CCGCGGAAAA AGCCATTGCT
GCGGACCTGA AACCGCACAA GATCGAAAAC ACCATGGCGT TCATGTTCGA GACCGGCAAA
GTGCTGCGCC CCAGCCTGCA TGCGCTGGCC TGCCCGCAGT TGCAGGCCGA TTACGACGCC
TGCTGGAACG GCATGGCCAG AACTTTCAAC AAGGAATCAT CCTGA
 
Protein sequence
MAIHSSFEAL AYQSGFANQF SSEALPGALP MGQNSPQKHP LGLYAEQFSG TAFTVARNEA 
RRTWLYRIKP SAAHPRYQRM DRQITGREQG PINPNRLRWN AFDIPAEPVD FIDGLIALAS
TSAADQAEGV SVYVYAANTS MQRAFFSADG EWLIVPQQGR LRIITEMGVL DIGPLEIAVL
PRGLKFSVQL LDGSARGYLC ENHGGVLRLP ELGPIGSNGL ANPRDFLTPV AWFEERDEPV
QLVQKFLGEL WTTQLQHSPF DVVGWHGNNV PYTYDLRRFN TIGTVSYDHP DPSIFTVLTS
PGAVHGQANI DFVIFPPRWM VAENTFRPPW FHRNLMNEFM GLIDGAYDAK AEGFMPGGSS
LHNCMSAHGP DNITAEKAIA ADLKPHKIEN TMAFMFETGK VLRPSLHALA CPQLQADYDA
CWNGMARTFN KESS