Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2052 |
Symbol | |
ID | 4445426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2313055 |
End bp | 2314188 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639689860 |
Product | cupin 2 domain-containing protein |
Protein accession | YP_831532 |
Protein GI | 116670599 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.748893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCATCA GCGCCGAGAA CACGACTCAT GAATCAGTGG CCGCGGGGCA CACTGCTCCG GAGCCGACGC CTGAAGAAGC TGCCCAGCTC GAGGAGCTGT ACCGGGATTT TGACCGGGAG AACCTGATCC CGTTGTGGAC TGAGATCGGT GACCTGATGC CGATGGTCCC CTCCCCGAAG GCGGTGCCGC ATGTGTGGCG GTGGAGCGAC CTGTACCCGC TGGCCGCCCG CGCAGGTGAC CTGGTGCCGG TGGGCCGCGG CGGGGAACGC CGCGCCATTG CCCTCGCCAA CCCTGGTTTG GGCGGCACGG CCTATGCCAC GCCCACTCTG TGGGCAGCCA TCCAGTACCT GGGCGGCCAC GAAACAGCCC CCGAGCACCG CCATTCCCAA AACGCGTTCC GCTTCGTCGT CGAAGGTGAA GGCGTGTGGA CCGTGGTGAA CGGGGACCCG GTCCGGATGT CCCGCGGTGA TTTCCTGCTG ACCCCGGGCT GGAACTTCCA CGGCCACCAC AACGACACCG ATGAGCCGAT GGCCTGGATC GACGGCCTGG ACATCCCGTT CGTGCACTAC GCGGACGCCG GGTTCTTCGA GTTCGGCACC GAACGGGTCA CCGACGAGGC CACCCCGGAC ATCTCCCGCT CCGAGCGGCT CTGGGCCCAC CCGGGCCTGC GCCCGCTCTC CGGCCTGGAT GACACCACCA GCTCCCCCAT CGCCGCGTAC CGGTGGGAAT ACACTGACCG TGCCCTGGCC GAGCAACTTT TGCTCGAGGA CGAGGGCCAC CCGGCCACCG TGTCCCAGGG CCACGCCGCT GTCCGTTACA CCAATCCCAC CACCGGCGGG GACGTGATGC CCACCATCCG GGCCGAATTC CACCGCCTCC GGCCCGGCGC GTCCACCCAG GGCGTCCGCG AGGTCGGCTC CAGCGTCTGG CAGGTCTTCG AAGGGACCGG TGCCGTTGTT CTCAACGGCG AACCCCGGAC CCTGGAAAAG GGCGACCTCT TCGTTGTCCC GTCCTGGGCT GAATGGTCCC TGCAGGCTGA GAGCGGGTTT GATCTGTTCC GGTTCAGCGA CGCCCCCATT TTTGAACGAC TGAACTTCAA CCGCACCTAC ATCGAAGGAC GCAAGAACGC ATGA
|
Protein sequence | MSISAENTTH ESVAAGHTAP EPTPEEAAQL EELYRDFDRE NLIPLWTEIG DLMPMVPSPK AVPHVWRWSD LYPLAARAGD LVPVGRGGER RAIALANPGL GGTAYATPTL WAAIQYLGGH ETAPEHRHSQ NAFRFVVEGE GVWTVVNGDP VRMSRGDFLL TPGWNFHGHH NDTDEPMAWI DGLDIPFVHY ADAGFFEFGT ERVTDEATPD ISRSERLWAH PGLRPLSGLD DTTSSPIAAY RWEYTDRALA EQLLLEDEGH PATVSQGHAA VRYTNPTTGG DVMPTIRAEF HRLRPGASTQ GVREVGSSVW QVFEGTGAVV LNGEPRTLEK GDLFVVPSWA EWSLQAESGF DLFRFSDAPI FERLNFNRTY IEGRKNA
|
| |