Gene Slin_1127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1127 
Symbol 
ID8724860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1376908 
End bp1378347 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content51% 
IMG OID 
ProductAlpha-L-fucosidase 
Protein accessionYP_003385977 
Protein GI284036047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.95278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AAATTACCTC CATTCTCTTC AGCTTACTGG TTTCGACATC GGCTCTGGCG 
CAACAGCACT CGGAGCAGAA TCACGATAAA TACGTCTGGC CCAAAGATGA ACTGGTAAAG
AAAAAACTGG CGAACTGGCA GGATATTAAG TTTGGCCTGC TCATGCACTG GGGCACTTAC
AGCGAGTGGG GCGTGGTGGA ATCGTGGTCA CTGTGCCCCG AAGATGAAGG CTGGTGCGAA
CGCAGAGGCC CCTATGCGGC CAATTGGTTC GAGTATAAAA AAGCGTACGA AAATCTGCAA
ACAACTTTCA ACCCTACCAA ATTCAATCCC GAACGCTGGG CCAATGCAGC AAAAGGGGCG
GGCATGAAAT ACGTGGTTTT CACCACCAAA CACCACGACG GCTTTTGCAT GTTCGACACC
AAGCTGACGG ACTATAAGAT CACTGATAAG AAAACCCCGT TCTCATCGAA TCCGCGCAGT
AACGTCACAA AAGAGATTCT TGGTGCTTTT CGTCAGCAGG GCTTTATGGT CGGCACGTAT
TTTTCCAAAC CCGATTGGCA TACCGAAGAT TACTGGTGGA CGTACTTTCC GCCCAAAGAT
CGTAACGTAA GCTACGACCC GAAAAAATAC CCCGACCACT GGAAGAAGTT CAGCGATTTC
ACCTATAACC AGATCGAAGA ACTAATGACC GGTTACGGCA ACGTCGATAT TCTGTGGCTC
GACGGCGGCT GGGTTCGTCC GGCCAGCACC ATCGACTCAA CCGTTAGTTG GCAGCGCACG
ATCCCATACA GCCAGGATAT AAACATGGCA CGCATTGCCG GTATGGCGCG CCAGCATCAG
CCGGGCCTTC TGGTTGTTGA CCGGACGGTA TCGGGCGAAT TCGAGAATTA CGTCACCCCG
GAACAGTCTA TCCCCGACCA TTACATGCCC ATTCCCTGGG AAAGCTGCAT CACTATGGGC
GATAGTTGGT CGTACATCCC TAAGGAGAAT TTCAAACCCG CCCGTAAACT GGTTCAGACA
TTGGTTGATA TCGTAGCCAA AAACGGGAAT CTCTTACTTA ACATTGCCCC CGGCCCCGAT
GGCGAATGGC ACGAAGAAGC CTATCAGCGA TTGCAGGAAA TTGGCAAATG GATAACCGTA
AACGGCGAGT CGATTTACGG CACCAAACCG TTGGCTCCCT ACCGGCAAGG GCAATGGGCG
TTCACCTCGA ACAATAAGGC TGTTTATGCC TCCTACCTGC CCAGCGAGTC GGAACAGCAG
CTTCCGGCCA GCATTTCATT ACCGGCGCTT ACCGTTGCGC CCAATGCTAA AGTGACCGTA
CTGGGTGCAT CACAGGCCTT GAAGCTGACC AAAACAAAGG ATGGTTTCAG CGTTGTCGTT
CCCGAGAAGG TACGTCAGCA ATTGGCTGGT CAACCCGTTT GGGTATTTAA AATTGGCTAA
 
Protein sequence
MKKQITSILF SLLVSTSALA QQHSEQNHDK YVWPKDELVK KKLANWQDIK FGLLMHWGTY 
SEWGVVESWS LCPEDEGWCE RRGPYAANWF EYKKAYENLQ TTFNPTKFNP ERWANAAKGA
GMKYVVFTTK HHDGFCMFDT KLTDYKITDK KTPFSSNPRS NVTKEILGAF RQQGFMVGTY
FSKPDWHTED YWWTYFPPKD RNVSYDPKKY PDHWKKFSDF TYNQIEELMT GYGNVDILWL
DGGWVRPAST IDSTVSWQRT IPYSQDINMA RIAGMARQHQ PGLLVVDRTV SGEFENYVTP
EQSIPDHYMP IPWESCITMG DSWSYIPKEN FKPARKLVQT LVDIVAKNGN LLLNIAPGPD
GEWHEEAYQR LQEIGKWITV NGESIYGTKP LAPYRQGQWA FTSNNKAVYA SYLPSESEQQ
LPASISLPAL TVAPNAKVTV LGASQALKLT KTKDGFSVVV PEKVRQQLAG QPVWVFKIG