Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_3551 |
Symbol | hmgA |
ID | 1185216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 4007204 |
End bp | 4008508 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637394904 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | NP_793331 |
Protein GI | 28870712 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.518718 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATTC ATTCCAGCTT CGAAGCGTTG GCCTATCAAT CGGGCTTCGC CAACCAGTTC AGCAGCGAGG CGCTGCCTGG CGCCTTGCCC ATGGGCCAGA ACTCGCCGCA GAAACACCCC CTGGGCCTGT ATGCCGAGCA GTTCTCAGGG ACCGCTTTCA CCGTCGCCCG CAACGAGGCC CGACGGACCT GGTTGTACCG CATCAAACCG TCGGCGGCAC ATCCGCGCTA TCAACGCATG GACCGGCAAA TCACTGGCCG TGAACAAGGC CCGATCAACC CCAATCGACT GCGCTGGAAC GCGTTCGATA TCCCGGCCGA GCCCGTCGAT TTCATTGACG GGCTGATTGC GCTGGCCAGC ACTTCAGCGG CAGATCAGGC CGAGGGCGTC AGCGTTTATG TGTACGCAGC CAATACGTCG ATGCAGCGCG CGTTTTTCAG CGCTGACGGG GAATGGTTGA TCGTGCCGCA ACAGGGGCGA CTCAGAATCA TCACCGAGAT GGGTGTGCTG GACATCGGCC CGCTGGAAAT CGCCGTGCTG CCACGCGGCC TGAAATTCAG CGTGCAACTG CTCGACGGCA GCGCTCGGGG TTACCTCTGC GAGAACCACG GTGGTGTCTT GCGCCTGCCG GAACTGGGGC CGATCGGTAG CAATGGTCTG GCCAACCCGC GTGATTTTCT GACGCCCGTG GCCTGGTTTG AAGAGCGTGA CGAGCCTGTG CAACTGGTGC AGAAATTTCT CGGCGAACTG TGGACGACGC AACTGCAACA CTCGCCATTC GATGTGGTGG GCTGGCATGG CAATAACGTG CCGTACACGT ACGATCTGCG GCGTTTTAAT ACGATTGGCA CGGTCAGCTA CGATCATCCC GACCCGTCGA TTTTCACCGT ACTGACCTCG CCAGGCGCCG TCCACGGTCA GGCCAACATC GATTTCGTGA TCTTCCCGCC ACGCTGGATG GTCGCTGAAA ACACCTTCAG GCCGCCGTGG TTCCATCGCA ACCTGATGAA CGAATTCATG GGTCTGATCG ACGGCGCCTA CGACGCCAAG GCTGAGGGTT TCATGCCGGG TGGCTCGTCG CTGCACAACT GCATGAGCGC CCACGGCCCG GACAATATCA CCGCGGAAAA AGCCATTGCT GCGGACCTGA AACCGCACAA GATCGAAAAC ACCATGGCGT TCATGTTCGA GACCGGCAAA GTGCTGCGCC CCAGCCTGCA TGCGCTGGCC TGCCCGCAGT TGCAGGCCGA TTACGACGCC TGCTGGAACG GCATGGCCAG AACTTTCAAC AAGGAATCAT CCTGA
|
Protein sequence | MAIHSSFEAL AYQSGFANQF SSEALPGALP MGQNSPQKHP LGLYAEQFSG TAFTVARNEA RRTWLYRIKP SAAHPRYQRM DRQITGREQG PINPNRLRWN AFDIPAEPVD FIDGLIALAS TSAADQAEGV SVYVYAANTS MQRAFFSADG EWLIVPQQGR LRIITEMGVL DIGPLEIAVL PRGLKFSVQL LDGSARGYLC ENHGGVLRLP ELGPIGSNGL ANPRDFLTPV AWFEERDEPV QLVQKFLGEL WTTQLQHSPF DVVGWHGNNV PYTYDLRRFN TIGTVSYDHP DPSIFTVLTS PGAVHGQANI DFVIFPPRWM VAENTFRPPW FHRNLMNEFM GLIDGAYDAK AEGFMPGGSS LHNCMSAHGP DNITAEKAIA ADLKPHKIEN TMAFMFETGK VLRPSLHALA CPQLQADYDA CWNGMARTFN KESS
|
| |