Gene Slin_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4336 
Symbol 
ID8728096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5257732 
End bp5259393 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content55% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003389117 
Protein GI284039187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.649243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CGTTTCTCCT CCTAATCGCT TTGTGCGTAA CGGCTCTGGC CACCCATGCC 
CAAACCAATA CGTACGCGCT GGTGCCCAAA CCCACTCACC TGGAAGCGCG CAGCGGCAGC
TACACGCTAC CCGCGAAACC AACTATTGCC GTTCAGGCAG CCGATCCTGA AATAAGCCGG
ATTGCCCGGA TGCTGGCCGA TCAGTTAGCC AAAGCAACCG GATCGGCACC GGTTGTGACC
ACAGGTAAAG CCTCCGGCGG CATTTCGTTC GTCAGTGCCA AAGGCCCGAA GCTTGGCGCA
GAAGGGTACA CCTTAACCGT TTCGCCAAAG CAGATTGTCA TTACGGCCGA GCAGCCTAAG
GGATTTTTCT ACGGCGTTCA ATCGCTGATG CAGTTGATGC CCGCAGCCGT TTTCAGCCCG
ACAAAAGTGA GTGGCGTTGC CTGGACTGTA CCGGCCTGCA CCATCGAAGA CCAGCCCCGG
TATGGGTATC GGGGGTCGAT GCTGGACGTT GGGCGGTATT TCTACCCCGT AGCCTTCATT
AAGAAATACA TCGACCTGCT GGCCGTGCAT AAAATGAATA CGTTTCACTG GCACCTGACC
GAAGACCAGG GCTGGCGGAT CGAAATCAAG AAGTACCCAA AGCTAACCGA GGTCAGTTCA
ATCCGGCCTA AAACGATGGT GGGCCACTAC CGGGATAAGG TCTACGACAA CAAGCCGTAT
GGCGGCTTTT ATACCCAGGA TGAGGTGCGC GATGTAGTCA AATACGCGCA GGAACGGTTT
GTGACGGTCA TTCCCGAAAT CGAAATGCCG GGCCACTCGG TAGCGGTTCT GGCGGCTTAT
CCAGAGTTAG GGAGCAACCC CGACAAGACG CTGACGCCCT CCCCAACATG GGGCGTTCAT
GACGATGTTC TGTTTCCGCG CGAAGAAACT TTCACCTTCC TGGAAAACGT CCTGACCGAA
GTGATGGCCC TGTTTCCGAG CCAGTACATC CATATCGGGG GCGACGAATG TCCGAAAACC
CAGTGGAAGC AAAGCAAGTT TTGCCAGGCC TTAATGAAGG AAAAAGGACT GAAAGACGAG
CACGAACTAC AAAGCTATTT TATCCAGCGT ATCGACAAGT TCGTTACCTC GAAAGGTCGC
CGGATTATTG GCTGGGACGA AATACTGGAA GGGGGCCTGT CGCCCAACGC GACCGTTATG
AGCTGGCGCG GCATCAACGG GGGTATTGCA GCCGCCCGGC AGAACCATGA CGTGATCATG
ACGCCAACCA CCTACTGCTA CCTCGACTAC TACCAGGCCG ACCCAAAAAC CCAGCCCGTG
ACCATCGGGG GCCTGCTCCC CATCGAAAAA GTGTACAGCT TCAACCCCTC GGTAACGGAC
AGCCTGACGG CTGAGCAGTC TAAACACGTG CTGGGCGTAC AGGCCAACGT TTGGTCGGAG
TACATGCCCA CGACTCACTC TGTGGAATAC ATGGCTTACC CGCGTCTCAT CGCCGTAGCC
GAAACCGGCT GGACACCCCA AGATGCCCGA AACATCGACG ATTTCAAACA ACGGCTGGAA
ACGCACAAAA AACGGCTTGA TTTTCTAAAG GTCAATTACT TCGGCGCACC CATCAACAAC
ACATTCCAAT ACGTGTGGCC GAAGGAAACG GCGAAGAAGT AA
 
Protein sequence
MKKSFLLLIA LCVTALATHA QTNTYALVPK PTHLEARSGS YTLPAKPTIA VQAADPEISR 
IARMLADQLA KATGSAPVVT TGKASGGISF VSAKGPKLGA EGYTLTVSPK QIVITAEQPK
GFFYGVQSLM QLMPAAVFSP TKVSGVAWTV PACTIEDQPR YGYRGSMLDV GRYFYPVAFI
KKYIDLLAVH KMNTFHWHLT EDQGWRIEIK KYPKLTEVSS IRPKTMVGHY RDKVYDNKPY
GGFYTQDEVR DVVKYAQERF VTVIPEIEMP GHSVAVLAAY PELGSNPDKT LTPSPTWGVH
DDVLFPREET FTFLENVLTE VMALFPSQYI HIGGDECPKT QWKQSKFCQA LMKEKGLKDE
HELQSYFIQR IDKFVTSKGR RIIGWDEILE GGLSPNATVM SWRGINGGIA AARQNHDVIM
TPTTYCYLDY YQADPKTQPV TIGGLLPIEK VYSFNPSVTD SLTAEQSKHV LGVQANVWSE
YMPTTHSVEY MAYPRLIAVA ETGWTPQDAR NIDDFKQRLE THKKRLDFLK VNYFGAPINN
TFQYVWPKET AKK