Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1701 |
Symbol | |
ID | 4445774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1899962 |
End bp | 1901389 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689523 |
Product | sulfatase |
Protein accession | YP_831195 |
Protein GI | 116670262 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.987903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCTG AACGCCCTAA TATCCTGTTC ATCATCGCCG ATCAGTTCCG CAACAGTGCT TTGGGATTCA GGGGCCAGGA TCCCACCTAC ACCCCATCCT TGAACTCCTT CGCAGCCGAA TCAAAGGACA TCCTCCACGC CGTCAGTAAC TATCCGGTCT GCAGTCCGCA CCGGGCAATG CTCATGACAG GGCAACACCC CCACCGTAAC GGCGTGCCTC TGAACATCAA TTCCAACACC GGTGCCGGTC TCGAACCCGG CATAGGCACA TGGTCACAGG TACTCCGGGA TGCGGGCTAC GGGACCGGAT ACATCGGCAA ATGGCATCTG GAAGCAGTCA CCGAAGAGGA CGCAATCTGG GGCGAAGGAT TTCGCGAAGG AGCTGTCTGG GATGCCTACT CGCCTGTGGA CAGGCGGCAC GGTTTCTCCT TCTGGTACTC GTACGGAGCC GCCCACGACC ACATGCACCC CCACTATTGG GTAGGGGACG CACCCCGCGA AGAGAAAATC GTCGTGGATC AATGGTCGGC AGAACACGAA ACCGACATTG CCATCGGCTT TTTGCGCGAA ACAACAGACG CTGCCGAGTC CTTCGCGCTG GTGGTGTCCT ACAACCCGCC GCACCAACCG TTCGAGCTGG CACCTGCAAC CTACCGCCCC AGGTACGCTC AACTCTCCGC ACGGGAACTA TTGACCAGAC CCAACGTCGA CGTCACGGGC CCGGCCGGAG CGGAAGCAGC GCAGGCAGCC CCGTTGTATT TCGCCGCCAT TTCCGCCATC GATCACCAAA TCGGCCGGCT GCTCGTCGCC TTGGAAGCCA GCGGTCACCA CAAAAACACG ATCGTGATCT TCACCTCGGA CCATGGCATG CAGCTGGGAA GCCACGGTCT TATGTTCAAA AATGTTCCTT GGGAAGAATC GATGTCGTTA CCATTTCTCA TCCGTTGGCC GGGCCGGATA GCGTCCGGTC CCGACGACAA GGTTTTGATC AGCTCCGTTG ATGTCGGTCC CACCCTCCTG GGCCTGGCGG GGCTATCTAC CAGCCGGCCG CCGGCCATGC AGGGCGCTGA TCTCTCGTCC CGGCTGATCG GCGCGACAAC GACGCCGGTC CCTGGTCCCG CCATCTACTA CGGGCCCCCG GCGCGGGACG GCGGGCCGGG GATGCGAGGC CTGAGGACCC TGAGTCACAA GTTGCTCTTC AGCTGTATCC CCGATCCGGC CCAACGATCC GGGTTCCTTC TCTCCGCGCA GCTCTACGAC CTGAAGTCCG ACCCCTACGA AATGAGTGAC CAGGCTGCCT CCCGCCCTGC TGAGGTTCAG CTGATGGGGC GTGAACTGGT ACGGCAGCTG GAGATCGTGG AGGATCCCTG GGCGGCAAGG GATCAGCTCC GGGAACACTT GGACGGGGAA ACGTGCAGCT GGAAATAG
|
Protein sequence | MTAERPNILF IIADQFRNSA LGFRGQDPTY TPSLNSFAAE SKDILHAVSN YPVCSPHRAM LMTGQHPHRN GVPLNINSNT GAGLEPGIGT WSQVLRDAGY GTGYIGKWHL EAVTEEDAIW GEGFREGAVW DAYSPVDRRH GFSFWYSYGA AHDHMHPHYW VGDAPREEKI VVDQWSAEHE TDIAIGFLRE TTDAAESFAL VVSYNPPHQP FELAPATYRP RYAQLSAREL LTRPNVDVTG PAGAEAAQAA PLYFAAISAI DHQIGRLLVA LEASGHHKNT IVIFTSDHGM QLGSHGLMFK NVPWEESMSL PFLIRWPGRI ASGPDDKVLI SSVDVGPTLL GLAGLSTSRP PAMQGADLSS RLIGATTTPV PGPAIYYGPP ARDGGPGMRG LRTLSHKLLF SCIPDPAQRS GFLLSAQLYD LKSDPYEMSD QAASRPAEVQ LMGRELVRQL EIVEDPWAAR DQLREHLDGE TCSWK
|
| |