Gene Slin_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1642 
Symbol 
ID8725377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1975065 
End bp1977041 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content53% 
IMG OID 
Producttype 3a cellulose-binding domain protein 
Protein accessionYP_003386488 
Protein GI284036558 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.117524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAT TTTTACGCTC ACTGAGCCTC GCATGGCTCT GGCTTACGGC CGTCTTCCTG 
TTGCACGGAG CTATCGCGCA ACCTGTCCAG CAAGGTGACC CAACACCGCA AATCACCGTC
ACGCCTTCAT CACTGACCAT TCTGAATTAC AGTGAGTATG AAGGTCAGTT TCCGGCCAGT
TACACTGTTT CGGCCAGCGG GTTAACAAAT GATCTGGTCA TTACTGCTCC GCCTTACTTC
CTGCTCAGCG CCCGGGGAAA TACAGGGCCG AGCCTTTCCC TGCCGATTGT CAATGGGGAA
GTATCACCAA CACAGGTTTC GGTAATACTG CTCGCGTCGT CACCCGGTAC GTTTACCGGT
GTCGTTACCA ATGTCAGTGG CTCCGCAACA GCGACTGTGG CGGTGAGTGG AACCGCCATT
TCGGAGTCGG TGAGTGTAAG CCCCAGTACT CTGAATTCGT TTACGACAAC GGCCGGGCAG
CCTTCAGCTG TTCAATCCTA TACCGTTACC AGTCGTGGAG GTCTGTCTGT TGTTGTCAAT
GCTCCAGCCG GATTCGAGAT TCGTACGGGC AGCGCAGCGT TTGGCTCATC GCTGGTGATT
GGCCCGAGTT TGTCGTATAA GAATACACAG GTCGATGTAC GCTTGGTTGG AACAACGCCC
GGAACTGTTT CGGGGGTTAT AGCCAATGAT ACCTACTACC ACTCGGCTCA CCTTACGTAT
CCGGTAGCAG TGAGTGGCGT GGTTACACCG GTTACTGCAT CTGCTTCACT GAGCGTGCTG
CACCGCGATG CTGATTATGG CAATCGAACG GATCAGCTTA TCCGACCTTA CCTTGAGCTT
GGTAATGAAG GCACTACGGC CATTCCGTAT AGCCAGATTA CCCTGCGGTA TTGGTTTACC
TCCGAGGGGG GCTCACCACC TACCGATTTG CAGGTGTACT ACGCACAGAT GGGAACCCGT
TACGTCAGGA TGAAGTATGT GCCGCTTGCA GAGCCGCGCC AGGGGGCATT CGGTTATGTT
GAGTACAGTT TCGACGCATC GGCGGGGAGC TTAGCCGCCG GGAGTCGGTC GGGTCCGATT
GAGAACGGTA TCCTGAAGCA GGATCGGTCA GCCTTCAACG AGTCTGACGA TTATTCGTAT
GCTACTCCAA CTACGTTTAC GCGTAATACG CATGTAACGG CCTACCTGAA TGGGCGGCTT
ATCTGGGGGG AAGAACCCGC CCCTGCACTG GTATTGCGGC AGGTTAAGGT TTATTCGGCC
GCAAAAAACA GTGATATCAC CAGCAGTATC AGTACCGTTC TTGAGGTGCG TAATACAGGG
AATGTGGCAA TTCCTTTGCA GGATTTGACG GTACGCTATT GGTTTACGTC CGAAACGAGC
CAGCTGCTCA ATAGCTATAT TGATTATGCA CAAATAGGTG CTCAGACCAT TAAGCACAAC
GTTGTTCGAC TGGCACAGCC AGTGTCGGGT GCCGATAGCT ACCTTGAACT GAGCTTTTCG
GCTGGAGCCG CTGGCCTGGC ACCGCTGAGT AGTACAGGGC AAATTCTGTT TCGGCTGGTA
AAGCCTGACT TCTCGTTGCT GAACCAGGTA AATGATTATT CTCACGGTCC TGTAAACCTG
ACCGAAAACC CCCACATAAC CGTCTATCTA CAGGGAAATC TGATTTATGG TACCGAGCCG
CCGGGTGGCA TGGGACGTAT GGGTGTACCG GATGAAAACA AGTTATTACA GGTCACGCTA
TTGGGCAATC CGGTACAGAA TGAACAGTTG ATTCTGGAAG CGCGTGGTGC CCAGGGTCTT
CCTTTGGTTC TGCAACTGGT TGACCGTCAG GGCGTACAGG TATTCGGGAA GGAGGTAAGC
GAGGCCGCCG ACGTGGAGCG GCAGCAGCTG GAAATGAGTC GGCAACCGGC GGGGGTATAC
CTATTACGCA TACGTACACC GAATCAGGAG CGGGTACTTA AAGTAATCAA GCCATAA
 
Protein sequence
MSSFLRSLSL AWLWLTAVFL LHGAIAQPVQ QGDPTPQITV TPSSLTILNY SEYEGQFPAS 
YTVSASGLTN DLVITAPPYF LLSARGNTGP SLSLPIVNGE VSPTQVSVIL LASSPGTFTG
VVTNVSGSAT ATVAVSGTAI SESVSVSPST LNSFTTTAGQ PSAVQSYTVT SRGGLSVVVN
APAGFEIRTG SAAFGSSLVI GPSLSYKNTQ VDVRLVGTTP GTVSGVIAND TYYHSAHLTY
PVAVSGVVTP VTASASLSVL HRDADYGNRT DQLIRPYLEL GNEGTTAIPY SQITLRYWFT
SEGGSPPTDL QVYYAQMGTR YVRMKYVPLA EPRQGAFGYV EYSFDASAGS LAAGSRSGPI
ENGILKQDRS AFNESDDYSY ATPTTFTRNT HVTAYLNGRL IWGEEPAPAL VLRQVKVYSA
AKNSDITSSI STVLEVRNTG NVAIPLQDLT VRYWFTSETS QLLNSYIDYA QIGAQTIKHN
VVRLAQPVSG ADSYLELSFS AGAAGLAPLS STGQILFRLV KPDFSLLNQV NDYSHGPVNL
TENPHITVYL QGNLIYGTEP PGGMGRMGVP DENKLLQVTL LGNPVQNEQL ILEARGAQGL
PLVLQLVDRQ GVQVFGKEVS EAADVERQQL EMSRQPAGVY LLRIRTPNQE RVLKVIKP