Gene Slin_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3975 
Symbol 
ID8727733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4773420 
End bp4775237 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content54% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003388764 
Protein GI284038834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0128397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAC TCATCGCGTT GCTTCTGCTG GCCTTCACCG CATCGGCTCA AATGCCGATT 
CCGGCGCTGA TTCCGGCTCC CCAAACGCTC GAACCCGGTA CCGGATCGTT CGCTATAACC
GCCCAAACCC GATTAGCCAT TCTCACGCCC GTAGCCGACG TTCGGCGGAT GGTGATGGAC
AATCTCCCCG GTATTCCGGC ATCTGATGCC CCGAAATTAA CCAGAGCCAT TACGGTTCGG
CTGGCTCCCG TGGCGGGTAT CGGACCGGAA GGTTATGACC TGGTAATTAC GCCAACTGGT
GTTACACTGA CGGCACCCGA AGCGGCTGGG TTGTTTTACG GTCTTCAAAC CATGCGGCAA
CTGATGCCCG TTGCTAAAAC GGTCCGCGGG CAGTCGATAC CGGCTCTGCA CATCCGCGAC
CAGCCCCGCT TTGGCTGGCG GGGGCTGATG CTCGACGTGA GCCGCCATTT CTTCGACAAG
CAGTTCGTGA AGCGGTACAT CGACCAGATG GCAACGTATA AATTCAACAT ATTCCACTGG
CATTTGTCCG ACGACCAGGG CTGGCGAATT CAGATCAACA GTTTACCTAA ACTAACCGAA
ATAGGTGCAT GGCGCGTACC GAGAACGGGC AGCTGGAACG AGATTGAAAA TCCACAGCCG
GGCGAAGTGC CATCGTACGG CGGTTTTTAC ACACAGGACG ATATTCGTGA AATCGTTCAG
TACGCACAGC AGCGTAACAT TACTATTGTG CCCGAAATTG ATATGCCAGG GCATATGATG
GCGGCTATTG CGGCTTATCC AGCCCTGACC TGCGGCCAGA AACAGGTGCT TGTGCCTACC
AATGGTAAGT TCTATAAAGT GGAGGACAAC ACCCTGAATC CATGTAACTA CGGCACGTAC
CTGTTTATCG ACAAGGTTCT GACGGAGATT GCCCAACTAT TTCCCGGACC TTATATTCAC
ATTGGGGGCG ACGAAGCCTA CAAAGGGTTT TGGTCTGGCT GCGAAGAGTG CAAGACAACT
ATGACGGTCA ATAACCTCAA AACGGTGGAG GAACTGCAAA GCTATTTCAT CCGGCGGGTG
GAGAAAATCG TACAGTCGAA AGGCAAAAAA CTCATTGGCT GGGATGAAAT TCTGGAGGGT
GGCTTAGCTC CCAACGCTAC GGTTATGAGC TGGCGGGGCA TGAAAGGTGG GATTGAGGCC
GCCAAACAAG GTCATCCCGT TATCATGACG CCCGCTCAGT TTTGTTACCT CGACCTCTAT
CAGGGCGAAC CCTCTGCCGA ACCCAGCACC TACAGCATGG CCCGGTTAAG CACTTCCTAT
TCCTTCGAGC CGGTCCCCGA CAGCGTACGC GCCGACCTGA TTCTGGGCGG GCAAGGCAAC
CTCTGGACGG AATCCGTACC CAACAACCGC CATGCTGAAT ACATGACCTG GCCACGCGCC
TTTGCGATAG CCGAGGTACT CTGGTCGCCG AAGAACCAGC GCAACTGGCC GGACTTCATG
AAACGGGTAG AAGCTCATTT CAAACGGTTC GACGCGCAGG ACGTAAATTA CGCCCTTAGC
GTGTACAACC CCATCGTTAG CCTGAAAAAG CACCCGATGG GTATGATGGA AGTTACGCTG
GGCCACGAAC TGCCCGATAC CTACCTCTAT TATTCAGTAG ATAACACCAT TCCCGACGAT
CATTTCCCTT TGTATACCCA ACCCTTCCAA ATGCCCAAAG GAGCCGAGCG GGTAAAAGTG
ATCGCTTACC GCAACGGAAA ACCCATTGGA AAACTAGTGG AGGTGATGAT GCCTAAAGTG
GAGAAAAAGG TTCAGTAA
 
Protein sequence
MYKLIALLLL AFTASAQMPI PALIPAPQTL EPGTGSFAIT AQTRLAILTP VADVRRMVMD 
NLPGIPASDA PKLTRAITVR LAPVAGIGPE GYDLVITPTG VTLTAPEAAG LFYGLQTMRQ
LMPVAKTVRG QSIPALHIRD QPRFGWRGLM LDVSRHFFDK QFVKRYIDQM ATYKFNIFHW
HLSDDQGWRI QINSLPKLTE IGAWRVPRTG SWNEIENPQP GEVPSYGGFY TQDDIREIVQ
YAQQRNITIV PEIDMPGHMM AAIAAYPALT CGQKQVLVPT NGKFYKVEDN TLNPCNYGTY
LFIDKVLTEI AQLFPGPYIH IGGDEAYKGF WSGCEECKTT MTVNNLKTVE ELQSYFIRRV
EKIVQSKGKK LIGWDEILEG GLAPNATVMS WRGMKGGIEA AKQGHPVIMT PAQFCYLDLY
QGEPSAEPST YSMARLSTSY SFEPVPDSVR ADLILGGQGN LWTESVPNNR HAEYMTWPRA
FAIAEVLWSP KNQRNWPDFM KRVEAHFKRF DAQDVNYALS VYNPIVSLKK HPMGMMEVTL
GHELPDTYLY YSVDNTIPDD HFPLYTQPFQ MPKGAERVKV IAYRNGKPIG KLVEVMMPKV
EKKVQ