Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1708 |
Symbol | |
ID | 4445747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1908516 |
End bp | 1909865 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639689530 |
Product | sulfatase |
Protein accession | YP_831202 |
Protein GI | 116670269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCATC CCGCCCCGGC TCAGCCGAAC ATCCTGATCA TCATGGCCGA CCAGTGGGCG GCGCACGCCA TGGGGTGCGC CGGATCGACT GTGGTGAACA CACCCAACCT GGACAATCTC GCCGCGGCCG GCACCCGCTT CGACCGCGCC TACACGACTT TCCCGCTGTG CGTACCTGCA CGCAGTTCGC TGGTCTCCGG GCGCTATCCC CACGAACTAG GAATCGACGG TAATGCCGTA CCAGCAGGCA GCGGGCCGGG GCGGACGCCG GGGAGTCTGG GTCACTGGTT CAAGGCGGCC GGCTACGACT GCGCCTACGC CGGCAAATGG CACGCCCCGG AGGCAAGCGC CCAGCCCGAG GACGGCTTCG ACGTCATCCA TCCCTTCGGC GATGAGGGGC TAACGGCCTC GGCGATCGAC TGGCTCGGCG CCCGCCATGA CACCGGCACG CCTTTCCTGC TGTTGGTCTC CTTCGATAAC CCCCATACCA TCTGCGAATA TGCCCGAGGC CAGCATCTGC CGTACGGGGA CGTCCAGCGG CCAGCAGACA TCCGAGACGC GCCTCCGCTG CCCTCGAATT TCGCCACAAC GCCCTATAGT CCCCAGGCGT TGACTCACGA ACGGGCCCAG GCCGAACAGG CTTACGGGAC GGCGGACTTC AGCCACGATG ACTGGCGGCT TTACAGGCAC GCATACGCGC AGCTCATCGA AAGGACTGAC GAACAGATCG GAGTCATCCT GGGTGAACTT GACCGTCAAG GCCTGAGGGA GACTACCGTA GTGCTCTTCA CCAGCGATCA TGGCGACGGA GACGCCGCCC ATGGCTGGAA CCAGAAGACC TCGTTACAGG AAGAAGCCAT ACGGGTTCCG CTGCTGATGA GGGGCCCCGG TGTCGGCTAC AGCCAGGTAG GCAGCCAGTT AATCTCCCTC GGCCTGGACC TCATTCCGAC GCTCTGCAGC CTGGCAGGCA TTGATGCCCC TGCCACCGCC ACCGGGGTGG ACTGGATCAC CGAACCGCGC GCGCCCGGGG AAGGGATTAC CGTCGAAACG GCTTTCAGCG CAGGACAGCG GGCCACCACT CTGGGGCGCG CCTTAATCAC TGGACGGTAC AAATACACCG TCTACAGCTG GGGTAAACAC CGGGAACAGC TGGTGGACCT CACGGCCGAT CCCGGCGAGC TCCGTAATCT CGCGGAAGAG TCCGCTTTCG ATGAGGTCCT GGAGGAATTC CGGCGACGGC TTCTGGATTG GTGTTGGGAA ACCGGCGATC AGGCGTTTCT GAAGAAACTC GTCCTGCCCC ATTCCGGGAG CAGCCTGGCC CGCAAGGAAA TCTACGCCGT GCCTTACTAG
|
Protein sequence | MSHPAPAQPN ILIIMADQWA AHAMGCAGST VVNTPNLDNL AAAGTRFDRA YTTFPLCVPA RSSLVSGRYP HELGIDGNAV PAGSGPGRTP GSLGHWFKAA GYDCAYAGKW HAPEASAQPE DGFDVIHPFG DEGLTASAID WLGARHDTGT PFLLLVSFDN PHTICEYARG QHLPYGDVQR PADIRDAPPL PSNFATTPYS PQALTHERAQ AEQAYGTADF SHDDWRLYRH AYAQLIERTD EQIGVILGEL DRQGLRETTV VLFTSDHGDG DAAHGWNQKT SLQEEAIRVP LLMRGPGVGY SQVGSQLISL GLDLIPTLCS LAGIDAPATA TGVDWITEPR APGEGITVET AFSAGQRATT LGRALITGRY KYTVYSWGKH REQLVDLTAD PGELRNLAEE SAFDEVLEEF RRRLLDWCWE TGDQAFLKKL VLPHSGSSLA RKEIYAVPY
|
| |