Gene Slin_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4161 
Symbol 
ID8727920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5009296 
End bp5010651 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content58% 
IMG OID 
Productglycoside hydrolase family 1 
Protein accessionYP_003388947 
Protein GI284039017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.167873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC AAAAGTCAAT TTGGGAAACG CTGAATAAAC CTTTACAAAA CAAATTAGCC 
TTATGGGGAG GTCTGGAATG TACCGTTAAC CGGGTAGGTG ACGTGTATCA GGATCAGATC
GTACGCAGTG GGCACCATGA TCGGCTAAGC GATCTGGATT TAATCGCCGA CTTAGGTATC
CGTGCCCTGC GCTACCCCAT TTTGTGGGAA CGTACAGCAC CCGATCACCC CGATCAACCG
GACTGGTCGT GGCCCGATGA ACGGCTCAAC CGGCTGCGTC AGTTAGATAT CCGGCCCATT
GTCGGGCTGG TGCATCACGG CTGCGGACCC CGCTATGCCA CGTACGACAC ACCGGCTTTT
GAAGACGGGC TGGCCCGCTT TGCCCGTCAG GTGGCCGAAC GGTACCCCTG GATCGATGCT
TATACGCCCA TCAACGAACC GCTGACAACC GCCCGTTTCG GAGGCTTGTA CGGACTCTGG
TATCCGCACG GCCGGTCTAA TCAGCTATTC GTCGATCTGT TGCTGCGCGA GTGCCGGGCT
ACCATCCGGG TTATGGCCGA AATCAGGGCC GTACAGCCCA ATGCCCAACT TATTCAGACC
GATGATCTGG GAAAAACTCA CAGCACCCCG GTTCTCAGCT ACCAGGCCGA ACTGGAGAAC
GAACGGCGGT GGCTGGGCTG GGATTTGCTC TGCGGACATG TAACGCCCCA CCACCCATTG
TGGGCGTACT TGCGGGAGTC GGGAGCATCC GAAGCCGATT TGTGGTATCT CATCGAGCAT
GCCTGCCCGC CATCGGTTAT AGGCGTTAAC CACTACGTCA CCAGCGAGCG GTATATAGAT
CACCGTATTC ATCACTATCC AACTCATTTA CACGGCGGCA ATGGTTATCA TCGATACGCC
GATACGGAGG TGGTACGGGC CGCCCCGGAG CAGCGCACCG GCTTAGCTAC CTTGCTACAG
GAAGCGTGGC AGCGGTATGC ACTACCCATC GTTGTGACAG AGGCCCATCT GGGCGATCGG
CCCGAAGAGC AGATGCGCTG GCTGGGCGAA CTCTGGCAAC AGGCGCAACA GGCTCAGGAT
GCCGGTGCCG ATGTCCGGGC CGTGTCGGTC TGGGCCATCA TGGGCCTGTA TGACTGGCAT
TGTCTCCTGA CCCGGCGGGA AGATCGACAC GAGCCGGGGG TATTCAACGT CAGCAGCGGC
ATCCCGGAAC CCACTGAGCT GGCAACCATG CTGAAACGCC TGACCGCTGG TGAACCGATA
GGGTCGTTAG TGCCGCCCGG ACCGGGTTGG TGGCAGACGC CCGACGCCCA AATCTACCAT
GCCTCGGAGC CTTTAGTCAG TAATTCAGTC GAGTAA
 
Protein sequence
MSNQKSIWET LNKPLQNKLA LWGGLECTVN RVGDVYQDQI VRSGHHDRLS DLDLIADLGI 
RALRYPILWE RTAPDHPDQP DWSWPDERLN RLRQLDIRPI VGLVHHGCGP RYATYDTPAF
EDGLARFARQ VAERYPWIDA YTPINEPLTT ARFGGLYGLW YPHGRSNQLF VDLLLRECRA
TIRVMAEIRA VQPNAQLIQT DDLGKTHSTP VLSYQAELEN ERRWLGWDLL CGHVTPHHPL
WAYLRESGAS EADLWYLIEH ACPPSVIGVN HYVTSERYID HRIHHYPTHL HGGNGYHRYA
DTEVVRAAPE QRTGLATLLQ EAWQRYALPI VVTEAHLGDR PEEQMRWLGE LWQQAQQAQD
AGADVRAVSV WAIMGLYDWH CLLTRREDRH EPGVFNVSSG IPEPTELATM LKRLTAGEPI
GSLVPPGPGW WQTPDAQIYH ASEPLVSNSV E