Gene Slin_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2031 
Symbol 
ID8725769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2453649 
End bp2455937 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content53% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003386875 
Protein GI284036945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.250491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.492134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC TGCTTTTTCT GCTTCTCCTG TCCAGTACTG TCTTTGCCCA ATCTGAAAAC 
GAATACAACC TTATTCCCTT TCCTGCCCGG TTTTCAGGGC AAAACGGCCA GTTTTCCCTC
TCGGCAACAA CCCGAATCGT CGTATCAGAC CCAACGGTAA AGGCGGTTGC CCAGACATTT
GCCAGTCAGG TCAAAGCCGC TACGGGCATT ACCCTTACCG TAGCGTCGGC CAGTCCGGCA
CTGGCCAAAG GCGCAAATAT TTTCTTTACC CTGAACAAAA AACTCACCCT AGGCGACGAG
GGCTATAAAC TGACCGTTAC CCCTACCCGT GTTCTGGCCG AAGCCTCGAC ACCCAAAGGG
TTGTTTTATG CGGCCCAGAC GATACGGCAA TTGATACCTG CCGGTGCATC GTCTACTGCC
GCGCTGCCTG CCTGCGCCAT TACGGACAAA CCGCGATTTG GCTACCGGGG CCTAATGCTC
GATGTAGGGC GTCATTTTAT GCCGGTTGCA TTCGTCAAGA AGTTCATCGA CCTGATGGCA
ATGCACAAAC AGAACACCTT TCACTGGCAT CTGACCGAAG ATCAGGGCTG GCGAATCGAA
ATAAAGAAGT ATCCTAAATT AACGCAGATC GGTAGCAAAC GAGCCGAATC CATTGTCGGT
CAGTATTATC AGAACTACCC CCAGCAGTTT GACGGTAAAC CCGTTTCGGG ATTCTACACG
CAGGAGGAAA TTAAAGATGT GGTCCGGTAC GCGCAGAGCC GGTTTGTGAC CATTATTCCC
GAAATTGAAA TGCCCGGTCA TGCTCAGGCT GCCCTGGCCG CGTACCCTGA GCTGGGCTGC
GACCCCGCCA AAGGTTATCA GGTATTCACA AAATGGGGCG TTTCGGAAGA CGTGTACTGC
CCGTCGGAGA AAACGTTTAC GTTCCTGCAG GACGTGTTGA CGGAAGTCAT CGCCTTGTTT
CCGGGAAAAT ACATTCACAT TGGGGGCGAT GAGTGCCCCA AAACGGCCTG GAAACAAAGT
GCCTTCTGCC AGGAGCTGAT GAAGAAGAAC AATCTGAAGG ATGAGCATGA ACTCCAGAGT
TATTTTATCC GGCGTGTTGA AAAGTTCCTG AACAGCAAAG GCCGCTCCAT CATCGGCTGG
GACGAAATTC TGGAGGGCGG ACTGGCCCCC AATGCGACCG TGATGAGCTG GCGCGGTACC
GAAGGCGGCA TTGCCGCGGC CAAACAGAAG CATAACGTGA TCATGACACC CGGCGGCACC
TGCTACCTCG ACCATTATCA GGGGAATCCG GCCACCGAGC CGCTCGCCAT TGGCGGCTAT
CTACCCCTCG ACAAAGTTTA CGGCTATGAG CCCATGCCTA CCGAACTGAC GGACGCCGAG
CAGAAATACG TGCTGGGTGT ACAGGGGAAC ATCTGGACAG AATACATGCC CACGTCGGAA
TCTGTCGAGT ACATGGCATT TCCCAGAGCC ATCGCCTTAG CTGAGATCGG CTGGATGCAA
GCGGGCACGC ATAATTTTGA GGATTTTAGC CAGCGACTCA AAAATCACCT GCCCCGGCTT
AAAAACGTGA ACTATGCCAA ACGCCTGTTC GACATTACGG CCAGTACACA AGCCGGTGAT
CAGGGCCAGA TTCAGGTCGT TCTGAAAAAA CTGGACAGCG ATAGCCGAAT CGTTTATACG
ACCAATGGGA AAGAACCGAA TGAGCAAAGT CCCGAATACA TTGGCCCCAT TACACTGACC
AAGACAACGA CTATTCGGGC TAAGACATGG ACTGGTGGGC AGCCAACCGG CGGCCAGCTT
ACCCAAACGT TTGTGTTGCA CAAAGGCAAG AACAAACCCT ACACATATGG CACACCGCTG
GATAAGTACA GTGATCCAAA ATCATCGAAA TTAACAGATG GCGTTCGGGG TGATACCCCC
CGGAGTCGGC AGGAATGGGT AAACGTGTAC GGCAACGACA TGGACGTTAC CCTCGACCTC
GGCAATGTGA CGAGCGTGAC CAAAGTATCG CTGAACTTCC TCAAGGTCAT TCTGGAGAAA
GGCTTCCCGC CAAAGTCGGT GGAAATTGCC TTGTCGAAGG ATGGCAGCGA CTTTAAAGAA
GCCATTGCAC AGCCCGTAGT GTATGAGTTA AACGGCCCCT GGGCTATTTT ACCCGTTGTT
GCGGACTTCA AAACAGCCCG AGCCCGTTAT GTTCGAATCC GGGCAAAGAA TGCCGGTGTT
TGCCCACCTG AACACCCAAA TGCGGGTGAA AAAACCTGGT TTTCCATCGA TGAAATTGTG
GTGGAATAG
 
Protein sequence
MKHLLFLLLL SSTVFAQSEN EYNLIPFPAR FSGQNGQFSL SATTRIVVSD PTVKAVAQTF 
ASQVKAATGI TLTVASASPA LAKGANIFFT LNKKLTLGDE GYKLTVTPTR VLAEASTPKG
LFYAAQTIRQ LIPAGASSTA ALPACAITDK PRFGYRGLML DVGRHFMPVA FVKKFIDLMA
MHKQNTFHWH LTEDQGWRIE IKKYPKLTQI GSKRAESIVG QYYQNYPQQF DGKPVSGFYT
QEEIKDVVRY AQSRFVTIIP EIEMPGHAQA ALAAYPELGC DPAKGYQVFT KWGVSEDVYC
PSEKTFTFLQ DVLTEVIALF PGKYIHIGGD ECPKTAWKQS AFCQELMKKN NLKDEHELQS
YFIRRVEKFL NSKGRSIIGW DEILEGGLAP NATVMSWRGT EGGIAAAKQK HNVIMTPGGT
CYLDHYQGNP ATEPLAIGGY LPLDKVYGYE PMPTELTDAE QKYVLGVQGN IWTEYMPTSE
SVEYMAFPRA IALAEIGWMQ AGTHNFEDFS QRLKNHLPRL KNVNYAKRLF DITASTQAGD
QGQIQVVLKK LDSDSRIVYT TNGKEPNEQS PEYIGPITLT KTTTIRAKTW TGGQPTGGQL
TQTFVLHKGK NKPYTYGTPL DKYSDPKSSK LTDGVRGDTP RSRQEWVNVY GNDMDVTLDL
GNVTSVTKVS LNFLKVILEK GFPPKSVEIA LSKDGSDFKE AIAQPVVYEL NGPWAILPVV
ADFKTARARY VRIRAKNAGV CPPEHPNAGE KTWFSIDEIV VE