Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1700 |
Symbol | |
ID | 4445773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1898523 |
End bp | 1899962 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639689522 |
Product | sulfatase |
Protein accession | YP_831194 |
Protein GI | 116670261 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0968821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACC ACCCCGGACC AAATATCCTG CTCATCCTCA GCGACGACCA GGGCGCATGG GCCTTGGGAT GCTCCGGCAA TACCGAAATC CAGACGCCCC ATCTGGACAA CCTGGCCTCA GGCGGAACGC GATTGGATAA CTTCTTCTGC GTCTCACCCG TCTGTTCCCC CGCGCGGGCC AGCCTGATGA CCGGCACCAT CCCCTCCAAA CATGGAGTGC ACGACTATCT TCATGGCGTC GAGACCGGCC CCGAGGCTCC CGACTATCTG CAGGGCCAGC GCCTCTTCAC GGATGACTTG GCAGCTGCGG GCTATTACAT GGGACTATCC GGAAAATGGC ACTTGGGAGC CAACGACCGG GCACGGGAGG GATTCAGTCA CTGGTTCTCG CTGGCCGGCG GCGGCAGCCC CTACGATGCG GCGACCATGT ACAGGAACGG CGTGAAGGAA ACAGTGTACG GCTATCTCAC TGATGCCATT ACAGCTGATT CGACCGGCTT CATGGAACGT GCCGCCGGGC AAGATTCCCC GTTCTTCCTG GCGCTGAATT ACACCGCACC GCATAAGCCC TGGAAGGACC AGCATCCTGC TGAATTTACG GCTCTCTATG ATGATTGTGC GTTTGAGAGC TGCCCCCAGG AGCCCACGCA TCCCTGGACG CCTACAGTCG ACGGGGTACC GATTGGAGGT GAAGCTGACG TCCGGGCGGC TTTGGTGGGC TACTTCGCCG CCGTCAGCGC GATGGACGCG GGGATAGGTC AGGTGCTCCA GAAGCTCGAT GAGCTCGGAT TGAGAGAAGA CACACTCGTG ATCTTCAGTA GCGACAACGG GTTCAACTGC GGGCAACACG GCGTGTGGGG CAAGGGAAAC GGCACGTTCC CCTTGAATGT CTTCGATTCC TCGATCAAAG TGCCCGCCAT ATTTTCTTTC CCCGGCAGAA TTGCCCGCGG GAAAGTGCGG GAAGAACTGC TGTCGGCATA CGACCTTCCG GCCACCATCC TGGAGCTGGC CGGGCTTGAC CCGCTCGAAT TCGAACAAGG GCCCGGGAAA TCCTTCGCCG ATGTTCTCCG CGGAAAGCCC CTCGCACCCG CCAGGCCCCG GCCGGTAGTT GTCTTCGATG AATACGGTCC GGTACGCATG ATCCGCAGCG ACTCATGGAA ATACGTCCAC CGCTACCCCC AGGGTCCCCA TGAACTGTAC GACCTGGCCA CCGACCCGGG GGAGCGGCAT AACCTGGTTC GGGAGGTCCG GCATGAAGAG CGCGTCGCCG GAATGCGCCG GGATATGCAG CTCTGGTTTG AGCAGTACCA GGAAGAGGAG GCAGACGGCC GCAAATTCCC CGTCGTCGGG GCAGGACAGA CGCTCCCGGT CCGGGCCGAT CCCCTAGGCG CATTCACGCC GCCAAGCTGG GACGGGATCT CTACTGCCGG AGGCCGATGA
|
Protein sequence | MSHHPGPNIL LILSDDQGAW ALGCSGNTEI QTPHLDNLAS GGTRLDNFFC VSPVCSPARA SLMTGTIPSK HGVHDYLHGV ETGPEAPDYL QGQRLFTDDL AAAGYYMGLS GKWHLGANDR AREGFSHWFS LAGGGSPYDA ATMYRNGVKE TVYGYLTDAI TADSTGFMER AAGQDSPFFL ALNYTAPHKP WKDQHPAEFT ALYDDCAFES CPQEPTHPWT PTVDGVPIGG EADVRAALVG YFAAVSAMDA GIGQVLQKLD ELGLREDTLV IFSSDNGFNC GQHGVWGKGN GTFPLNVFDS SIKVPAIFSF PGRIARGKVR EELLSAYDLP ATILELAGLD PLEFEQGPGK SFADVLRGKP LAPARPRPVV VFDEYGPVRM IRSDSWKYVH RYPQGPHELY DLATDPGERH NLVREVRHEE RVAGMRRDMQ LWFEQYQEEE ADGRKFPVVG AGQTLPVRAD PLGAFTPPSW DGISTAGGR
|
| |