Gene Slin_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3893 
Symbol 
ID8727651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4669658 
End bp4671940 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content57% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003388682 
Protein GI284038752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.470709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCA AACCGATCAT TTTCATCGTC GGGCTCAGTC TGCTGATGGC CGGTATAGGT 
CAATCGCAAA CGCCCGATCC ATACCCCATT ATCCCGTACC CGACTTCGCT GGTACCGGGT
CAGGGGCAGT TTGTTATCAC CCAAAATACC GCGCTGGTTG TCCAGGATGG TCGTTTCCAG
AGCGAGGCCG GTCAGTTACA GCAATTGCTG AAACCGGTTT TGGGCAAGCC ACTGCCTGCC
CGTGGAGGCA ACGCGCAGAT CGTGCTCAAT TACGACCCAT CCATCACGTC ACCGGAAGGC
TACCAGCTTA CCATCGCCCC CCAGCGCATA ACCCTGGCGG CCAGAGAGCC GGTGGGCATG
TTTCGGGCCA TTCAGACCAT CCGGCAACTG CTGCCCGTGA GCCTTGAGCA GAAAAAAACA
ACCGGCCCAC TGACCCTCCC GGCGGTGCAG ATCCGCGACC AGCCCGCCTA TGCCTGGCGG
GGCATGCACC TCGATGTGTC GCGGCACTTT TTCTCGATGG ATTACCTGCA TAAATTCGTC
GATCTGCTGG CTCTTTACAA GTTTAATAAA TTTCACCTGC ACCTCACCGA CGACCAGGGC
TGGCGGCTGG AGATAAAGGC GTACCCCAAG CTAACCAGCG AAGGAGCCTG GCGGACGTTC
AATAACCAGG ATTCTGTCGT GCTGAAACGG GCCACCACAA ATCCCGACTT CGACCTGCCG
AAGCAATACC TCCGGCAGAA AGACGGGAAG ACGCAATATG GCGGGTTTTA TACTCAGAAC
CAGATGCGCG AACTCATCGC GTACGCAGCC GCCCGCCATA TCGAAATCAT TCCGGAAATC
GACATGCCCG GCCATTTGAC GGCGGCCATC AAAGCGTATC CATTTCTAAG CTGTACGGGT
CAGGAAGGCT GGGGCAAAAC GTTTTCGGTG CCCATCTGCC CCTGCAACGA ACCCACCTAC
ACGTTTACCG AAACCGTACT GAGCGAAGTA GCCGCGCTGT TCCCGAGCCA GTACATCCAC
ATCGGGGCCG ATGAAGTGGA GAAATCGACC TGGGCGCAGT CGACGGCCTG TCAGGCGTTG
ATGAAGCGGG AGGGAATCAA AAGCGTGGAG GAACTGCAAA GTTATTTCGT ACACCGCACC
GAAAAGTTTC TGCTGTCGAA AGGGAAAAAA CTCATGGTCT GGGACGACGC TCTGGAGGGC
GGACTGGCAC CCTCGGCCAC GGTCATGTAC TGGCGGAGCT GGGTGACCGA TGCGCCCGTG
AAAGCTGTTC GGAATGGCAA CCCGGTGGTT ATGACGCCCG TTAACACCCT TTATTTCGAC
GTGTTGCCCG ACAAAAATTC GCTCGCAAAT GTCTATCAAT TCAATCCCGT TCCAACCGGG
CTGACACCCG CCGAAGCGAC ATCCATTCTG GGCGCACAGG CCAACACCTG GACGGAATAT
ATTCCCTCCG AAAATCGGGT CGATTACATG GTGATGCCCC GCATGACGGC CCTGGCCGAA
CGGCTGTGGA CCAATCAGAA TCAGTATGAC ACTTACCGGC AGCGCCTTAC CCGGCACTAC
CCGCGTCTGG ATGCGCTGGG GGTTCACTAC CGCGTGCCCG ATCTGAGCGG CTTTGCCGAA
GAGAATGTGT TTACGGACCA AACGGCCCTG CGCATCCGCA AACCGACGGA TAATCTGGTT
GTTCGCTACA CCATCGATGG GAGCTTGCCG AAGGCGGCTT CGGCGCTACT TCCCGAGTCG
TTGCCGATCA GCCAGCCAAC GACGGTAAAG CTGGCGGCTT TTACGAATAG CGGCTTGCAG
GGCGATGTGT ATACGCTTCG GTATCAACAG CAATCGCTTG CTGAACCGGT GGGAGTATCG
TCCGTCGGGG CCGGATTGAT GAGTACCTAT GTGAAGGGAC AGTTTAAAAA TGTAGCCGCC
ATGCTAAAGG CACCGGCTTC CGATTCGGTG GTGGTGAATC AGGTAAAGGT GCCGGAAATG
GCCGGAGCGG GTAGTTTCGG CGTTCGGTTT CGGGGTTACA TCAGCGTTCC CGCCACGGGT
ATTTACAGCT TTTTCCTGGT AGCCGACGAT GGGGGTGTGC TGCACATTGC CAACCGGACG
GTTATCGACA ACGACGGTAA CCACGGCCCC ATTGAAAAAA GCGGTCAGGT GGCCCTCAAA
CGGGGTACGC ATCCTTTCGC CCTCGACTTC ATCGAAGCGG GTGGCGGGTA TACCCTAAAG
CTGCTGTACA GCCGCGACGG TAGTGATCCA CAACCCGTAC CCGCCGACTG GCTGGGGCAT
TGA
 
Protein sequence
MQIKPIIFIV GLSLLMAGIG QSQTPDPYPI IPYPTSLVPG QGQFVITQNT ALVVQDGRFQ 
SEAGQLQQLL KPVLGKPLPA RGGNAQIVLN YDPSITSPEG YQLTIAPQRI TLAAREPVGM
FRAIQTIRQL LPVSLEQKKT TGPLTLPAVQ IRDQPAYAWR GMHLDVSRHF FSMDYLHKFV
DLLALYKFNK FHLHLTDDQG WRLEIKAYPK LTSEGAWRTF NNQDSVVLKR ATTNPDFDLP
KQYLRQKDGK TQYGGFYTQN QMRELIAYAA ARHIEIIPEI DMPGHLTAAI KAYPFLSCTG
QEGWGKTFSV PICPCNEPTY TFTETVLSEV AALFPSQYIH IGADEVEKST WAQSTACQAL
MKREGIKSVE ELQSYFVHRT EKFLLSKGKK LMVWDDALEG GLAPSATVMY WRSWVTDAPV
KAVRNGNPVV MTPVNTLYFD VLPDKNSLAN VYQFNPVPTG LTPAEATSIL GAQANTWTEY
IPSENRVDYM VMPRMTALAE RLWTNQNQYD TYRQRLTRHY PRLDALGVHY RVPDLSGFAE
ENVFTDQTAL RIRKPTDNLV VRYTIDGSLP KAASALLPES LPISQPTTVK LAAFTNSGLQ
GDVYTLRYQQ QSLAEPVGVS SVGAGLMSTY VKGQFKNVAA MLKAPASDSV VVNQVKVPEM
AGAGSFGVRF RGYISVPATG IYSFFLVADD GGVLHIANRT VIDNDGNHGP IEKSGQVALK
RGTHPFALDF IEAGGGYTLK LLYSRDGSDP QPVPADWLGH