Gene Arth_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0543 
Symbol 
ID4446993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp576730 
End bp579777 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content64% 
IMG OID639688340 
Productexo-alpha-sialidase 
Protein accessionYP_830042 
Protein GI116669109 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGACA AGTCGCTATG CCCGACACGT TTTCGCACTG CTGCCCTAGG GCGGCTACCT 
GAAAGGACAG TGGATTTGAA ACTCAAAAAG AGGGAGCCGG CCGGCAGGGC CGGCTCCGTG
GGCCGGGTGG CCGCCGCAGG GCTCCTTGGG ATGGCGCTGA TCGCGGGACC CGGGTTGCCG
GCCAGGGCCG AGCCGGCTCC GCCCTCCAAT CCGGCAGCCG CCCCGGGCAC CTTCGCGGAA
GCGAACATTG CCGCGGACCG GACGGCCGCC AATTTCTTTT ACCGTATTCC CGCGCTCACC
TACCTCGGGA ACGACGTTGT ACTTGCAGCG TGGGACGGCA GGCCCGGGAG CTCGGCCGAC
GCGCCGAACC CGAACTCGAT CGTGCAGCGC CGCAGTATCG ACGGCGGCGC AACGTGGGGT
CCTTTGACCG TCATCGCTGC CGGCCATGTG GCTGATGCCA GCGGCCCCAA ATACGGGTTC
AGTGATCCGT CGTACATCTA CGACGCTGAG GCGAACAAAG TGTTCGCCCT GTTCGTTTAC
TCAAAGGATG CCGGCTTCTC TGCCAGCACC TACGGCAACG ACGACGCCGA CAGGAATGTC
ATTTCCTCGG CCGTGGTGGA GTCCGCCGAC GAAGGCCGCA CCTGGAGCCA GCCCCGGTTC
ATCACAAGCG TCACGAAACC CGGAAGCAGT AAGACCAACC CGCAGCCGGG TGACGTACGC
ACCAACTTCG CGGCATCCGG TGAGGGGATC CAGCTCAAGT ACGGCGCTCA CAAAGGCCGG
TTGATTCAGC AGTACTCGGG TTACGTGCGT CAAGCCAACG GTTCGGAACT CTTCCAGGCC
TACAGCGTCT ATTCAGATGA CCATGGCGCA ACGTGGCACA AAGGGGCCCC GATCGGCGAC
CGCATGGACG AGAACAAGAC CGTGGAACTC TCCGACGGCA GGGTGCTGCT GAATTCGAGG
GACAGCGGGA ACGGCGGCTA TCGCAAAGTG GCCGTGTCCA CCGACGGCGG CGCCAGCTAC
GGGCCGGTTA CGCAGGACAC CGAACTGCCG GACCCTGCCA ACAACGGGTC AATCTCCCGG
ATGTACCCGG CCGCACCGGA GGGCTCAGCC GAGGCAAGGA AGCTGATCTT CACCAATTCC
AACTCCAAGG CCGCCAGGGA AAACGTCTCG GCGCGGGTGT CCTGTGACGA CGGAGCAACG
TGGCCCGGTG TCCGCACCAT CCGTCCCGGC TTCTCCGCGT ATTCAACCAT TACCCGCCTG
GCCGAGGGCA AGTTCGGCGT CCTGTACGAG GCGAACTACA CGGACAACAT ACAGTTCGCC
AGTTTCGACG ACGCCTGGCT GAACTATGTC TGCGCTCCCG TGAACGTGCC CGCACAAACC
ATTGCGCCCG GTGTTGCGCA GCAGGTTCCG GTGACAGTTA CCAACCAGGA AGCCCACGTC
CTGTCGGGCG CCCGGGCCAG TATCTATACG CCAGCGGGAT GGTCCGCCGC CACTGTGGAC
GTTCCTGACC TTGCAACGGG TAGCTCGGCC ACGGTGAACG TCCAGCTCAC ACCGCCGGCC
GGAGCTTCGG GTCCAACTTC CCTCAATGCG GCTTTCACCA CTGCCGACGG AAGAGTGTCC
CAGTACACGT TCGTTGCCAA CAGTCCGGTA GCTCCCCAGG TTGGCCTGAC CATCGCAGGC
TCAGCGCCGG CACGGGACGT GGCGGCGAAC CCGTACAAGG AAGGCGAGGT GCTGTCTTAC
ACCTTCGCGG TCAAGAGCAC GTCGAACGTC ACGTCCAATG CCGTCCCCCT TTCCGGGACC
TTCGAGACCG GGTTCCTGCC GCCGTCGGCC CCTAACTGCC GGTACAACAA CCTTGCCGCC
GGTGCCAGCT ACAACTGCAC GACGCCTAAG CACACGCTTA CTCCTGAAGA CATAGCGCGC
GGCTACCTCG TCCCTGTGGC TGAGTTCACC GTCACGGCCT CCGGCAATAC GGCACTGACG
AAGGCAGTGT CCTTCAAAGG AGCAGCCGTA CCGTTGCGGG ATGGCCTGCT GGCCGGATCG
ATCAGCGGTG CCCGGAATGA TGCCGGACGT GACCTCGCCG TGCAGCCGTA TGCAGCCGGC
GAGCAGGTGC CCTACACGTT TACCGTCAGC AACACCGGCC CCCTGGCCGC GGACGTTGTG
CCGATTGCCG GCAATTTCTC ACCCCTCGTA CCCCCGGGCG CGGGAAACTG CCGGTGGCTC
AACCTTGCCG CGGGAGGATC CTACGCATGC TCCACACCGC GGCACACCGT GACCCAAAAA
GAGGCGGAGG AGGGATTCTT CCGTGCCGAC TCCACTTGGA CAGTTGCTGC GTCCGGGCAG
AGCAGCCGGG AATACCGTGT GGACGGCGGC GAAGTGGACC TCGCGATCCG GAACCCGAAG
CTGGACGGCA CGATCTCGGC TGAATGGGCC GATGCCGACG GCGACCGCTA CGCGAGTGCC
GGGGATTCCG TCACCTACAC CTACGGCGTG GGAAATGCCG GCAATGTCGC GCTGACCGGC
GTCACGGCTA CGGATGCCGG CATTTCAGTG GACAGGCTGG GCATCGGGGA GACAGCAACG
GCAACCAGGG TGCACATCCT GACTCCCGCA GATATCGCGG CCGGCCAGTT GCCGGCCTCT
CCGTTTGCCG CCTCTGCATC CAACGGGTCG CGGAACGTGC GCGTTGACGT GCAGGCCGGA
GCGGTGGCCC TGCGGCTTCA GCCAGCCAAA CCGGCGGCCG TTCCGGTGTT GACGGTCCAG
GATTTCGACG GGCAGGTTCC GCCCGTCGAC CTGGACACCA ATGAAAAATA CCGTAACGGC
GAGAAGGTGA CGCTCCGCGG CCTTCCCCAC GGCCAGTGGT ATTACGTCTA CCTGAACAAG
CACGGCTTCC GCCTCGGCTG GATCTTTCCC ACCACGGCGG ACACGGTGGA GTTCCTCCTG
CCCTCCACTG TGCAGAACGG GCGGGACGAC GTGGTGGTCC TGGATTCCGA AGGGAAGCAG
GTTTCCTTTG ACCGACTTCA GGTCACACCG AAAGGGTCCA TCGGCTGA
 
Protein sequence
MWDKSLCPTR FRTAALGRLP ERTVDLKLKK REPAGRAGSV GRVAAAGLLG MALIAGPGLP 
ARAEPAPPSN PAAAPGTFAE ANIAADRTAA NFFYRIPALT YLGNDVVLAA WDGRPGSSAD
APNPNSIVQR RSIDGGATWG PLTVIAAGHV ADASGPKYGF SDPSYIYDAE ANKVFALFVY
SKDAGFSAST YGNDDADRNV ISSAVVESAD EGRTWSQPRF ITSVTKPGSS KTNPQPGDVR
TNFAASGEGI QLKYGAHKGR LIQQYSGYVR QANGSELFQA YSVYSDDHGA TWHKGAPIGD
RMDENKTVEL SDGRVLLNSR DSGNGGYRKV AVSTDGGASY GPVTQDTELP DPANNGSISR
MYPAAPEGSA EARKLIFTNS NSKAARENVS ARVSCDDGAT WPGVRTIRPG FSAYSTITRL
AEGKFGVLYE ANYTDNIQFA SFDDAWLNYV CAPVNVPAQT IAPGVAQQVP VTVTNQEAHV
LSGARASIYT PAGWSAATVD VPDLATGSSA TVNVQLTPPA GASGPTSLNA AFTTADGRVS
QYTFVANSPV APQVGLTIAG SAPARDVAAN PYKEGEVLSY TFAVKSTSNV TSNAVPLSGT
FETGFLPPSA PNCRYNNLAA GASYNCTTPK HTLTPEDIAR GYLVPVAEFT VTASGNTALT
KAVSFKGAAV PLRDGLLAGS ISGARNDAGR DLAVQPYAAG EQVPYTFTVS NTGPLAADVV
PIAGNFSPLV PPGAGNCRWL NLAAGGSYAC STPRHTVTQK EAEEGFFRAD STWTVAASGQ
SSREYRVDGG EVDLAIRNPK LDGTISAEWA DADGDRYASA GDSVTYTYGV GNAGNVALTG
VTATDAGISV DRLGIGETAT ATRVHILTPA DIAAGQLPAS PFAASASNGS RNVRVDVQAG
AVALRLQPAK PAAVPVLTVQ DFDGQVPPVD LDTNEKYRNG EKVTLRGLPH GQWYYVYLNK
HGFRLGWIFP TTADTVEFLL PSTVQNGRDD VVVLDSEGKQ VSFDRLQVTP KGSIG