Gene Arth_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0189 
Symbol 
ID4447347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp194057 
End bp195826 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content68% 
IMG OID639687984 
Productexo-alpha-sialidase 
Protein accessionYP_829690 
Protein GI116668757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCC ACCTCTTTTC TTCAGCCGAT GGCATTCCAA TGGCGCGTCC CGCCACGGAA 
CACGTACTGG CTGAACGCGG CAGGGGCGGC TACCGGCAGT ACAGGATTCC CGCGATGGCT
GTGACGGCCC GCGGCACCCT GCTCGCCGCC TATGACGGAC GACCCAACCT TGATGATCTT
CCCAGCCCGA TAGATCTTCT GCTACGGCGC AGCGGGGACA GCGGCCGGTC CTGGCAGGCC
CAGCAGGTGG TGCGCACCGG CGTCGGGCTT GAAGGGTACG GGGATCCAAG CCTGCTGGTG
GACGCCGAAA CGGGCCGGAT CTTCATGTTC CACGCGGCCG GCACTCGCGC AGGCTTCTTT
GAAGCCGCCG AAGGAGCGGG GAACGACGGC GACGTCCAGC ACTGCGACGT CAGCTACTCC
GACGACGACG GGCTGACCTG GCGGCACCGC CGACTCACGG ACCAGCTCAA GCTCGCTGCC
GGCCGGGGCC GCGGAATTAC CGGAATCTTC GCCGCGGCGG GGCAGGGCAT CCAAATCCAT
GCCGGGCCGT TCTCCGGGAG GCTCGTCCAG CAGTATGTGG TCCTGGTGCA GGGGAACATT
TTGGCGGCGT CCGCCTACAG CGACGACCAC GGCGAAACCT GGACGCTGGG GGAGTTCATC
GGCCCTTCGA CGGCGGGTGG GGAAGCAGGC GCGCCGACAG CTGGCGAGGC GGCACCCAAC
GAGAACAAGG TCGCCTGCCT CAACGACGGC AGGCTCCTGC TCCACAGCCG CGCCACTCCA
CGCCGGCTAT CAGCGCTCTC CGGCGACGGC GGGCACAGCT GGAGTCCGCT GACCGCCATT
GCGGACCTTC CCGATCCCAG CGACAACGGC TCGGTGACCC GATTTGACGG TCTGCCGCTT
CCGCTAGGCC ACGCCACCGC AGAAACGGAC GCGTGGGTGC TGGCAACCAA CAACCACGAC
ACCGCGTTGC GCCGCAACAC CGTGCTCAGC CTCTCCCGGG ACAACGGGGC CAGCTGGCCG
GCGAAGCTCG TCATTTGCCC GGGCAGCTCG GCGTATTCAA CGGCGGCCAG GCTGCCCGAC
GGGAACATCG GAGTCCTCTA CGAGCGCCAG GGCTACCGGG AAATTGTGTT CTGCTCCATT
CCGCCGGCCC AGCTCACGGA TGCCGCCGCA CTCCGTCAGG GCAACCGCCG TGACGCCCGC
CAGGACTCTG CTCGGCCAGA CCAGCCTTCC GGAAGGGCTT CCGGCCCCCC GGCATCGGGG
ATGGTATTCG ATATGGAACT GCGCTCCATC ACGCCCGGCC GTCCCGAGGT GTGGCAGAAC
GCCGGCGAAT TCCACGTGCT GGCCGCCGAT GCGGGGCAGT GGGACGTCCA CACGTGGAAG
GAGATCGGCC AGGGGTACTC GCCGGAGTCA GCCCAAGTGG TCGGAACGCG CGAGGCGCAG
GACCTGAACT ACGGGCCGGT CTCCACCGGC TACAAGGCGG GGGACATCCT TGCCTTCACC
GGGCGCGCAC GCAACAGCGG CCCGGAGGTG TTGGCCGGCG TGCGGCTGAC TGGTCCGGGG
GCGGAAGAGT TCGGCCCGGC AGACCTGCAG CCCGGCGAGG AATCGCTGTA CTTCACCCCT
GTCTGCACCG TGACGGCGCA GGATGTGGGG CGCGGTTCGC TGGACGTCAG GTACGGGGTG
GAGTGGACGG CGGCCGGCGT GAACGGGCGG CTGGAGCGCC GGTTCACGTT CAACATGGCA
GACGGCAGCG TGGACATTGG CCCGGTGTAA
 
Protein sequence
MTSHLFSSAD GIPMARPATE HVLAERGRGG YRQYRIPAMA VTARGTLLAA YDGRPNLDDL 
PSPIDLLLRR SGDSGRSWQA QQVVRTGVGL EGYGDPSLLV DAETGRIFMF HAAGTRAGFF
EAAEGAGNDG DVQHCDVSYS DDDGLTWRHR RLTDQLKLAA GRGRGITGIF AAAGQGIQIH
AGPFSGRLVQ QYVVLVQGNI LAASAYSDDH GETWTLGEFI GPSTAGGEAG APTAGEAAPN
ENKVACLNDG RLLLHSRATP RRLSALSGDG GHSWSPLTAI ADLPDPSDNG SVTRFDGLPL
PLGHATAETD AWVLATNNHD TALRRNTVLS LSRDNGASWP AKLVICPGSS AYSTAARLPD
GNIGVLYERQ GYREIVFCSI PPAQLTDAAA LRQGNRRDAR QDSARPDQPS GRASGPPASG
MVFDMELRSI TPGRPEVWQN AGEFHVLAAD AGQWDVHTWK EIGQGYSPES AQVVGTREAQ
DLNYGPVSTG YKAGDILAFT GRARNSGPEV LAGVRLTGPG AEEFGPADLQ PGEESLYFTP
VCTVTAQDVG RGSLDVRYGV EWTAAGVNGR LERRFTFNMA DGSVDIGPV