Gene Slin_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4788 
Symbol 
ID8728552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5836436 
End bp5838124 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID 
ProductXylan 1,4-beta-xylosidase 
Protein accessionYP_003389565 
Protein GI284039635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.771274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTAC GCGTAACCCT TGGTTTACTG TTTTTAACGG TCCTCCCAGC CGTTGCCCAA 
TCAGCAAAAA TGACTAGCCT AAAATCCGTT TCCGGCACGG CACGCTCAAC AATATCTTCC
GTCTGGGTAC CTGATTTGGG AAATGGTTCG TACAAAAACC CCGTCCTGAA CGCCGACTAC
TCCGATCCGG ACGTATGCCG GGCAGGCAAC GACTTTTATC TGGTCGCTTC CAGCTTCGAC
GCCATTCCCG GCCTGCCCAT TCTGCACTCG ACGGATCTGG TCAACTGGAC CCTCATCGGG
CACGCCCTGA AACGCCAGCC TCCGTTCGAC CATTTCGAGA AAACCCAGCA CGGCAATGGG
GTATGGGCAC CGGCCATCCG CTACCACAAC AGCGAGTTCT ACATCTACTA TCCCGACCCC
GATTTTGGCA TTTATCTGAC CAAAGCAACC AATCCGGCTG GCCCCTGGTC GGAGCCGGTG
CTGGTGGAAG GCGGCAAAGG GCTGATCGAC CCCTGCCCGC TCTGGGACAA TGACGGGCAG
GTTTATCTGG CACACGGCTG GGCCGGAAGT AGGGCGGGTA TCAAAAGTAT TCTGACCATC
AAAAAGCTCA ATGCGGCTGG CTCTAAAGTG ACGGATGAAG GCGTTATCGT GTACGACGGT
CACGAAACAG ACCCCACCAT CGAAGGGCCG AAACTCTATA AACGAAACGG CTATTACTAC
ATTTTTGCCC CGGCGGGGGG CGTCTCAACG GGCTGGCAAC TGGCGTTACG ATCCAAAAAT
ATCTACGGCC CTTACGAGCG AAAAGTGGTG ATGGATCAGG GCACAACGTC AATCAACGGA
CCGCATCAGG GCGCGTGGGT AACGGCAAAC GCGGGCAAGG GGAAAGCCGA TGAAGACTGG
TTTCTGCATT TTCAGGATAA GGCCGCCTAC GGTCGGGTGG TGCACCTGCA ACCTATGAAA
TGGGTGAACG ACTGGCCCGT CATTGGCATG GATGCGGACG GCGACGGTAA AGGCGAACCC
GTTTTGACCT ATAAAAAGCC AGCTAATGGG AAAGCATCGC CCATAGCCAC CCCGCCAGAG
TCAGATGAAT TCAACAGCCT GAAACTGGGA GTGCAGTGGC AATGGCAGGC GAATCCAAAA
GGTATCTGGC ACACAACCAG CAACCAGGGC TTTTTGCGGC TCTACTCAGC CAAAGCACCA
GACGAAGCCC GCAACCTGTG GGACGTGCCG AATGTGCTGA TGCAGAAGTT TCCGGCGGAA
ACCTTCATGG CTACTACAAA ACTGACCTTT ACGCCCAACT CGAAGCTGGA AAATGAAAAA
ACGGGGCTGG TCGTAATGGG GCTGAGCTAT GTCGGCCTGA CGCTGAAAAG CGCCAAAGAG
GGTATCCGGC TGGTGTACAG CGTTTGTAAA AAGGCCGCCG ACGGGAAGCC GGAAGACGAA
ACGACCATCA CCCGAGTCCC AGCCAACACG CCGATTTACC TGCGGGTTCA GGTTACGAAG
GGAGCCAAAT GCCAGTTCAG CTACAGCCTG GATGGGCAGA CGTTCACTAA AACCGGCGAC
GAGTTTCAGG CCGAAGTGGG CCGCTGGATT GGGGCAAAAA TGGGGATTTT CTGCACCCGG
ACCACCCAGA TCAACGATTC AGGCTACGCC GATTTCGACT GGTTCCGGGT GGAGCCGAAT
GCGGACTGA
 
Protein sequence
MTLRVTLGLL FLTVLPAVAQ SAKMTSLKSV SGTARSTISS VWVPDLGNGS YKNPVLNADY 
SDPDVCRAGN DFYLVASSFD AIPGLPILHS TDLVNWTLIG HALKRQPPFD HFEKTQHGNG
VWAPAIRYHN SEFYIYYPDP DFGIYLTKAT NPAGPWSEPV LVEGGKGLID PCPLWDNDGQ
VYLAHGWAGS RAGIKSILTI KKLNAAGSKV TDEGVIVYDG HETDPTIEGP KLYKRNGYYY
IFAPAGGVST GWQLALRSKN IYGPYERKVV MDQGTTSING PHQGAWVTAN AGKGKADEDW
FLHFQDKAAY GRVVHLQPMK WVNDWPVIGM DADGDGKGEP VLTYKKPANG KASPIATPPE
SDEFNSLKLG VQWQWQANPK GIWHTTSNQG FLRLYSAKAP DEARNLWDVP NVLMQKFPAE
TFMATTKLTF TPNSKLENEK TGLVVMGLSY VGLTLKSAKE GIRLVYSVCK KAADGKPEDE
TTITRVPANT PIYLRVQVTK GAKCQFSYSL DGQTFTKTGD EFQAEVGRWI GAKMGIFCTR
TTQINDSGYA DFDWFRVEPN AD