Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1716 |
Symbol | |
ID | 4445755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1917550 |
End bp | 1919076 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639689538 |
Product | sulfatase |
Protein accession | YP_831210 |
Protein GI | 116670277 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAG AAGGCGCACC AAGGACAAAC ATTCTGTTTC TCATGACAGA CCAACAACGC ATCGATACAA TGGGCTGCTA CGGAAATAGG TCCCGTCACA CCCCCTACCT TGACGGGCTG GCAGCCCGGG GCACTGTGTA CGACCGCGCT TACACTCCCA CGGCCATCTG CACGCCCGCC CGCGCATCCC TCCTGACAGG GCTTCATCCC TTCGAGCACG GGCTGCTGTC AAATTTCGAG TGGAACTCCG GTCACCGGGA CGAACTGCCC GACGGTACTC CCACTTTTGC CGACGAACTC AGGAAGCAGG GATACCGGTT GGGGCACGTC GGCAAATGGC ACGTCGGGCG GGAGCGCGGT CCGGATTTCT ACGGCTTTGA AGGGGAGCAC CTGCCCGGGG CCCTGAACAC CTTCGATAAC CCGGCATACA CGTCCTGGCT TGCGGAGAAA GGGTTCCCCT CATTCCGCAT AGTGGACCCG GTGTACACCG TTCAAAAAGA CGGATCGCAG GGGCACCTCA TCGCAGGGAT CACTGACCAG CCCACAGAAG CGACGTTCGA AGCCTGGCTG GCGGACCAGA CCATCGCCAA GCTCCGCGAG TTTGCCCAGA CCCACCCGGC TGGAGGCGCC CCAGGCACCG AAACAGCCGT CGCACCCTTC TACCTGTCCT GCCACATCTT CGGACCCCAT TTGCCGTATC TCATTCCGAG GCAATGGTAT GACTTGGTGG ATCCAGCAAC GGTGCAGCTG CCCAAGTCCT TCGCTGAAAC TTTTAACGGC AAACCTCTGG TCCAACAGAC CTACGCCGAA TACTGGTCCA CCGATTCATT CACGGTAGAG GAATGGAAGA AACTGACCGC GGTCTACTGG GGCTACGTTT CCATGATCGA CCACGAGATC GGACGCATCC TCCAGACCGT CGAGGAACTG GGGCTCAACG ATTCGACCGT GATCATGTTC ACCGCGGATC ACGGCGAGTT CACCGGCGCA CACAGGCTCA ACGACAAGGG GCCTGCAATG TACGAGGATA TTTACCGTAT CCCCGCTATT GTCGCTGCGC CCGGCCAGGA ACCCAGACGG GAATCAAAAT TCGTCTCCCT CCAGGACTTC ACCGCCACGT TCATCGACAT CGCCGACGGC TATGCCGGAA ATATTCGCGG GAGTTCATTG ATGCCCTCCA CGACCGCTCC ACTGCCCGCT GACTGGCGAA CAGAGATGGT GTGCGAATTC CACGGACACC ATTTTCCTTA CGCGCAACGG ATGATCCGTA ATGAACGATA CAAGTACATC GCCAACCCGG AAGGGATTGA CGAGTTCTAC GATCTGGTCA GCGACCCCGA CGAACTCCAT AACGTGGTAA CTGTGCCCGC CTACGCGACG CAGCTCAAGA CGATGCGGCT GAGTCTCTAC AAGGAACTCG TCTCCAGAGG TGACAAGTTC TATCAGTGGC TGGCATTCGC AGGGGACATC GAACCCGAAG ATCGACTCAG GCCCGACACC GCCCTCGAAC GCTTCGTAAC CCAATGA
|
Protein sequence | MAVEGAPRTN ILFLMTDQQR IDTMGCYGNR SRHTPYLDGL AARGTVYDRA YTPTAICTPA RASLLTGLHP FEHGLLSNFE WNSGHRDELP DGTPTFADEL RKQGYRLGHV GKWHVGRERG PDFYGFEGEH LPGALNTFDN PAYTSWLAEK GFPSFRIVDP VYTVQKDGSQ GHLIAGITDQ PTEATFEAWL ADQTIAKLRE FAQTHPAGGA PGTETAVAPF YLSCHIFGPH LPYLIPRQWY DLVDPATVQL PKSFAETFNG KPLVQQTYAE YWSTDSFTVE EWKKLTAVYW GYVSMIDHEI GRILQTVEEL GLNDSTVIMF TADHGEFTGA HRLNDKGPAM YEDIYRIPAI VAAPGQEPRR ESKFVSLQDF TATFIDIADG YAGNIRGSSL MPSTTAPLPA DWRTEMVCEF HGHHFPYAQR MIRNERYKYI ANPEGIDEFY DLVSDPDELH NVVTVPAYAT QLKTMRLSLY KELVSRGDKF YQWLAFAGDI EPEDRLRPDT ALERFVTQ
|
| |