Gene Snas_5835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5835 
Symbol 
ID8887051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6199606 
End bp6200784 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID 
ProductExo-alpha-sialidase 
Protein accessionYP_003514558 
Protein GI291303280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTGC TCCGTCCGCT CGTCATCGCG GCCGTGGCCG CGCTGACCGT CACCGCCGGT 
CTGCTCACCG CAGGCCCCGC CTCCGCCGCC CCCGACGTCC AGACCATCTT CACCAAGGGC
GAGAACGGCT ACGGCTGCCA CCGCATCCCC GCCATCCTGC GGGCCGGGAA CGGCGACCTG
CTGGCCTTCG CCGAAGCCCG CACCGAGTTC TGCGGCGACA CCGGCCACAT CGACCTGGTC
ATGAAACGCT CCACCGACGA CGGAGCGACC TGGGGCAAGT CCCAGATCGT GCTACAGGGC
ACCGACGACG ACCCCGACGC GGCCGCCACC CGCGGCAACC CGGTGCCGAT CCTCGACGAG
AGCACCGGCC GCATCGTGCT GCTGTCCACA CACAACCCGT CCAACGCCGA CCAGCCCCGC
ACCCCGTACG TCCAACACAG CGACGACGAC GGCCAGACCT GGAGCACCGC CAAGAGCCTC
GGCGACGTCA TCGACGAACC CGACTGGGGC TGGTATGCCA CCGGCCCCGG CGGCGGCATC
CAGCTCACCC GGGGCGAACA CGCCGGACGG CTGGTCGTGG GCGTCAACTT CTCCGACGGC
TCCGGCAAGA ACGGCGCCGC CCTGGTCTAC AGCGACGACG GCGGCGAGAC CTGGACCCGC
GGTGCCACCG ACGTCCCCGC GACCGACGAC ATCATCCCGC AGGAACTGAA CCTCTTCGAG
CGCACCGACG GCGGCATCTA CGCCGCGGCG CGGGAGAACG CGGGCACCAA CACCCAGACC
CGCGCCTTCG CCGTCAGCAC CGACGCCGGA GCCAGCTTCG AGGCGCCGTT CAAGCTGATT
CCCGACCTCG TCGGCACACC CAAGGTCCAG GGCTCGATCC TTCGCCTGCG CGCCACCGAC
TCCGGCGACT CCTACGACCG GGTGCTGTTC GCGTCCCCTG TGCACTCCAA GCTGCGCATG
ACCATGACGA TCCGCTCGTC CTTCGACGAG GGGAAGACCT GGCAGAGCGT CGACGAGGGC
ACCGTCATCG ACGAGGACCG CGCCGGTTAC TCCAACATGG CCGTTCTGGG CAACGGCGAC
ATCGGACTCC TCTACGAAGC GGGTGCCTAC CCCGACGGCG ACGCCCGCGA CGACATCCGC
TTCGCCCGCA TCAGCGAGTC GGATCTGGGT GTGCCGTAA
 
Protein sequence
MRLLRPLVIA AVAALTVTAG LLTAGPASAA PDVQTIFTKG ENGYGCHRIP AILRAGNGDL 
LAFAEARTEF CGDTGHIDLV MKRSTDDGAT WGKSQIVLQG TDDDPDAAAT RGNPVPILDE
STGRIVLLST HNPSNADQPR TPYVQHSDDD GQTWSTAKSL GDVIDEPDWG WYATGPGGGI
QLTRGEHAGR LVVGVNFSDG SGKNGAALVY SDDGGETWTR GATDVPATDD IIPQELNLFE
RTDGGIYAAA RENAGTNTQT RAFAVSTDAG ASFEAPFKLI PDLVGTPKVQ GSILRLRATD
SGDSYDRVLF ASPVHSKLRM TMTIRSSFDE GKTWQSVDEG TVIDEDRAGY SNMAVLGNGD
IGLLYEAGAY PDGDARDDIR FARISESDLG VP