Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4373 |
Symbol | pcmA |
ID | 4443453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | - |
Start bp | 111456 |
End bp | 112757 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639687694 |
Product | protocatechuate 4,5-dioxygenase |
Protein accession | YP_829391 |
Protein GI | 116662337 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02792] protocatechuate 4,5-dioxygenase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTCG ACAAACCATA CAAAGACGTC CCTGGCACCA CCATCTTTGA TGCCGACCAG GCCCGAAAGG GCTACAACCT AAACCAGTTC TGCATGTCGC TGATGAAACC GGAGAACCGC GAACGGTATC TGGCCGACCG CGGTGCGTAC CTGGACGAGT GGCCTCTCAA CCCGGTGCAG CGCCAGGCGG TGCTCGACAT CGACCTCAAC ACCTGCATCG CCGAGGGCGG GAACATCTAC TTCCTGGCCA AGATCGGCGC CACTCACGGC CTGAGCTTTC AGCAGATGGC CGGCTCGATG ACGGGCATGT CTGAGGCCGC GTACCGAGAC ATGATGATCG GCGGTGGGCG CCGCCCGGAG GGAAACCGCC TCAAGGACCT CGACGGATGG ACGCCTCCTG AGCCCGGCGA GAAGGCGGAG ACGGTGCGGC AGGATGCTCC GGCGCAGTAC ACCTCGGCAC TATTCACCTC GCACGTGCCG GCGATCGGCG CAGCGATGGA CCTCGGTAAG ACCGAGGAGC CGTACTGGAA GAAGGTGTTC TCCGGGTATG AGTGGACGCG AGAGTGGGCC AAGGAGAACC TGCCCGACGT CGTCATCCTG GTGTACAACG ACCACGCCAC AGCCTTCGAC TCCTCGATCA TCCCGACCTT CGTACTCGGC ACCGGCGCGG AGTACCCGGT CGCGGACGAG GGCTATGGCC CCCGCCCTGT GCCCGACGTC AAAGGCTATC CCGAGCTGGC CGCACACATC GCCCAGTCCG TGATCCAGGA TGACTTCGAC CTCACGCTCG TCAACGAGAT GGTCGTGGAT CACGGCCTCA CGGTGCCGCT GTCGCTTGTG TATGGCGATG TCGAGGAATG GCCCGTCAGG GTCATCCCCC TCGCCGTGAA CGTTGTGCAG TATCCGGTGC CGTCCGGACG TCGCTGCTAC GAACTCGGGC GTGCGCTCCG CCGCGCGCTG GACAAGTGGG ACGGTGAGCC GCTCAACGTT CAAATTTGGG GAACCGGCGG CATGAGCCAC CAGCTGCAGG GCCCCCGCGC TGGCCTCATT AACGAGGAGT GGGACAACGC ATTCCTGGAC CACCTCATCG CCGACCCCGT GGGCCTGACA GAATGGCAGC ACATGGAGTA CGTCGACGAG GCCGGTTCCG AGGGCATCGA GCTAGTCGAC TGGCTCATCG CGCGCGGTGC GATGGATGAT CAGTTCGGAG GCGAGAGCCC CGAGGTGAAT CACCGCTTTT ACCACGTGCC CGCGTCGAAC ACCGCCGTCG GCCACCTTGT TCTCACGAAC CCGACCGACT GA
|
Protein sequence | MTLDKPYKDV PGTTIFDADQ ARKGYNLNQF CMSLMKPENR ERYLADRGAY LDEWPLNPVQ RQAVLDIDLN TCIAEGGNIY FLAKIGATHG LSFQQMAGSM TGMSEAAYRD MMIGGGRRPE GNRLKDLDGW TPPEPGEKAE TVRQDAPAQY TSALFTSHVP AIGAAMDLGK TEEPYWKKVF SGYEWTREWA KENLPDVVIL VYNDHATAFD SSIIPTFVLG TGAEYPVADE GYGPRPVPDV KGYPELAAHI AQSVIQDDFD LTLVNEMVVD HGLTVPLSLV YGDVEEWPVR VIPLAVNVVQ YPVPSGRRCY ELGRALRRAL DKWDGEPLNV QIWGTGGMSH QLQGPRAGLI NEEWDNAFLD HLIADPVGLT EWQHMEYVDE AGSEGIELVD WLIARGAMDD QFGGESPEVN HRFYHVPASN TAVGHLVLTN PTD
|
| |