Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0543 |
Symbol | |
ID | 4446993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 576730 |
End bp | 579777 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688340 |
Product | exo-alpha-sialidase |
Protein accession | YP_830042 |
Protein GI | 116669109 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4409] Neuraminidase (sialidase) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGACA AGTCGCTATG CCCGACACGT TTTCGCACTG CTGCCCTAGG GCGGCTACCT GAAAGGACAG TGGATTTGAA ACTCAAAAAG AGGGAGCCGG CCGGCAGGGC CGGCTCCGTG GGCCGGGTGG CCGCCGCAGG GCTCCTTGGG ATGGCGCTGA TCGCGGGACC CGGGTTGCCG GCCAGGGCCG AGCCGGCTCC GCCCTCCAAT CCGGCAGCCG CCCCGGGCAC CTTCGCGGAA GCGAACATTG CCGCGGACCG GACGGCCGCC AATTTCTTTT ACCGTATTCC CGCGCTCACC TACCTCGGGA ACGACGTTGT ACTTGCAGCG TGGGACGGCA GGCCCGGGAG CTCGGCCGAC GCGCCGAACC CGAACTCGAT CGTGCAGCGC CGCAGTATCG ACGGCGGCGC AACGTGGGGT CCTTTGACCG TCATCGCTGC CGGCCATGTG GCTGATGCCA GCGGCCCCAA ATACGGGTTC AGTGATCCGT CGTACATCTA CGACGCTGAG GCGAACAAAG TGTTCGCCCT GTTCGTTTAC TCAAAGGATG CCGGCTTCTC TGCCAGCACC TACGGCAACG ACGACGCCGA CAGGAATGTC ATTTCCTCGG CCGTGGTGGA GTCCGCCGAC GAAGGCCGCA CCTGGAGCCA GCCCCGGTTC ATCACAAGCG TCACGAAACC CGGAAGCAGT AAGACCAACC CGCAGCCGGG TGACGTACGC ACCAACTTCG CGGCATCCGG TGAGGGGATC CAGCTCAAGT ACGGCGCTCA CAAAGGCCGG TTGATTCAGC AGTACTCGGG TTACGTGCGT CAAGCCAACG GTTCGGAACT CTTCCAGGCC TACAGCGTCT ATTCAGATGA CCATGGCGCA ACGTGGCACA AAGGGGCCCC GATCGGCGAC CGCATGGACG AGAACAAGAC CGTGGAACTC TCCGACGGCA GGGTGCTGCT GAATTCGAGG GACAGCGGGA ACGGCGGCTA TCGCAAAGTG GCCGTGTCCA CCGACGGCGG CGCCAGCTAC GGGCCGGTTA CGCAGGACAC CGAACTGCCG GACCCTGCCA ACAACGGGTC AATCTCCCGG ATGTACCCGG CCGCACCGGA GGGCTCAGCC GAGGCAAGGA AGCTGATCTT CACCAATTCC AACTCCAAGG CCGCCAGGGA AAACGTCTCG GCGCGGGTGT CCTGTGACGA CGGAGCAACG TGGCCCGGTG TCCGCACCAT CCGTCCCGGC TTCTCCGCGT ATTCAACCAT TACCCGCCTG GCCGAGGGCA AGTTCGGCGT CCTGTACGAG GCGAACTACA CGGACAACAT ACAGTTCGCC AGTTTCGACG ACGCCTGGCT GAACTATGTC TGCGCTCCCG TGAACGTGCC CGCACAAACC ATTGCGCCCG GTGTTGCGCA GCAGGTTCCG GTGACAGTTA CCAACCAGGA AGCCCACGTC CTGTCGGGCG CCCGGGCCAG TATCTATACG CCAGCGGGAT GGTCCGCCGC CACTGTGGAC GTTCCTGACC TTGCAACGGG TAGCTCGGCC ACGGTGAACG TCCAGCTCAC ACCGCCGGCC GGAGCTTCGG GTCCAACTTC CCTCAATGCG GCTTTCACCA CTGCCGACGG AAGAGTGTCC CAGTACACGT TCGTTGCCAA CAGTCCGGTA GCTCCCCAGG TTGGCCTGAC CATCGCAGGC TCAGCGCCGG CACGGGACGT GGCGGCGAAC CCGTACAAGG AAGGCGAGGT GCTGTCTTAC ACCTTCGCGG TCAAGAGCAC GTCGAACGTC ACGTCCAATG CCGTCCCCCT TTCCGGGACC TTCGAGACCG GGTTCCTGCC GCCGTCGGCC CCTAACTGCC GGTACAACAA CCTTGCCGCC GGTGCCAGCT ACAACTGCAC GACGCCTAAG CACACGCTTA CTCCTGAAGA CATAGCGCGC GGCTACCTCG TCCCTGTGGC TGAGTTCACC GTCACGGCCT CCGGCAATAC GGCACTGACG AAGGCAGTGT CCTTCAAAGG AGCAGCCGTA CCGTTGCGGG ATGGCCTGCT GGCCGGATCG ATCAGCGGTG CCCGGAATGA TGCCGGACGT GACCTCGCCG TGCAGCCGTA TGCAGCCGGC GAGCAGGTGC CCTACACGTT TACCGTCAGC AACACCGGCC CCCTGGCCGC GGACGTTGTG CCGATTGCCG GCAATTTCTC ACCCCTCGTA CCCCCGGGCG CGGGAAACTG CCGGTGGCTC AACCTTGCCG CGGGAGGATC CTACGCATGC TCCACACCGC GGCACACCGT GACCCAAAAA GAGGCGGAGG AGGGATTCTT CCGTGCCGAC TCCACTTGGA CAGTTGCTGC GTCCGGGCAG AGCAGCCGGG AATACCGTGT GGACGGCGGC GAAGTGGACC TCGCGATCCG GAACCCGAAG CTGGACGGCA CGATCTCGGC TGAATGGGCC GATGCCGACG GCGACCGCTA CGCGAGTGCC GGGGATTCCG TCACCTACAC CTACGGCGTG GGAAATGCCG GCAATGTCGC GCTGACCGGC GTCACGGCTA CGGATGCCGG CATTTCAGTG GACAGGCTGG GCATCGGGGA GACAGCAACG GCAACCAGGG TGCACATCCT GACTCCCGCA GATATCGCGG CCGGCCAGTT GCCGGCCTCT CCGTTTGCCG CCTCTGCATC CAACGGGTCG CGGAACGTGC GCGTTGACGT GCAGGCCGGA GCGGTGGCCC TGCGGCTTCA GCCAGCCAAA CCGGCGGCCG TTCCGGTGTT GACGGTCCAG GATTTCGACG GGCAGGTTCC GCCCGTCGAC CTGGACACCA ATGAAAAATA CCGTAACGGC GAGAAGGTGA CGCTCCGCGG CCTTCCCCAC GGCCAGTGGT ATTACGTCTA CCTGAACAAG CACGGCTTCC GCCTCGGCTG GATCTTTCCC ACCACGGCGG ACACGGTGGA GTTCCTCCTG CCCTCCACTG TGCAGAACGG GCGGGACGAC GTGGTGGTCC TGGATTCCGA AGGGAAGCAG GTTTCCTTTG ACCGACTTCA GGTCACACCG AAAGGGTCCA TCGGCTGA
|
Protein sequence | MWDKSLCPTR FRTAALGRLP ERTVDLKLKK REPAGRAGSV GRVAAAGLLG MALIAGPGLP ARAEPAPPSN PAAAPGTFAE ANIAADRTAA NFFYRIPALT YLGNDVVLAA WDGRPGSSAD APNPNSIVQR RSIDGGATWG PLTVIAAGHV ADASGPKYGF SDPSYIYDAE ANKVFALFVY SKDAGFSAST YGNDDADRNV ISSAVVESAD EGRTWSQPRF ITSVTKPGSS KTNPQPGDVR TNFAASGEGI QLKYGAHKGR LIQQYSGYVR QANGSELFQA YSVYSDDHGA TWHKGAPIGD RMDENKTVEL SDGRVLLNSR DSGNGGYRKV AVSTDGGASY GPVTQDTELP DPANNGSISR MYPAAPEGSA EARKLIFTNS NSKAARENVS ARVSCDDGAT WPGVRTIRPG FSAYSTITRL AEGKFGVLYE ANYTDNIQFA SFDDAWLNYV CAPVNVPAQT IAPGVAQQVP VTVTNQEAHV LSGARASIYT PAGWSAATVD VPDLATGSSA TVNVQLTPPA GASGPTSLNA AFTTADGRVS QYTFVANSPV APQVGLTIAG SAPARDVAAN PYKEGEVLSY TFAVKSTSNV TSNAVPLSGT FETGFLPPSA PNCRYNNLAA GASYNCTTPK HTLTPEDIAR GYLVPVAEFT VTASGNTALT KAVSFKGAAV PLRDGLLAGS ISGARNDAGR DLAVQPYAAG EQVPYTFTVS NTGPLAADVV PIAGNFSPLV PPGAGNCRWL NLAAGGSYAC STPRHTVTQK EAEEGFFRAD STWTVAASGQ SSREYRVDGG EVDLAIRNPK LDGTISAEWA DADGDRYASA GDSVTYTYGV GNAGNVALTG VTATDAGISV DRLGIGETAT ATRVHILTPA DIAAGQLPAS PFAASASNGS RNVRVDVQAG AVALRLQPAK PAAVPVLTVQ DFDGQVPPVD LDTNEKYRNG EKVTLRGLPH GQWYYVYLNK HGFRLGWIFP TTADTVEFLL PSTVQNGRDD VVVLDSEGKQ VSFDRLQVTP KGSIG
|
| |