Gene Slin_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1888 
Symbol 
ID8725625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2282765 
End bp2284249 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content48% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003386732 
Protein GI284036802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.389553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC ACGTTGTGAA CAAAGCCCAA CGGGCAACTT TAAATCAGGT GATCGGGCAA 
CATAAATCCT TACTGGTCGT CGTGCTGCTT GGTTTTATTG GGTCCTTATC GGGCTGTAAG
AAAACGACCG ATGACGTTAC CCCGCAAACG ACGACGGCCA CGGTCAACGA AAACACTACG
GTCGATAGCT GGATACTGGC CAACATGAGA GATCTGTATT ACTGGAACGA TAAAATTCCG
GCTAATCCCG ATACGACCCT GGCTCCCGAT GTTTTCTTCG ATTCTATTCT GAACAAATAC
AACGCCACGA CGAATCCTAC CGGCGACCGT TTTTCGTGGA TAGAAAACGA TGCAAATACG
TTGACGGCCG AGTTAAGTGG CGAATCAACG ACAACCGGTA TGAATTTCAA TCTTTACCTG
CGGGCATCAG GCTCAACGGG GGTTATCGCC CAGGTGCTGT ACGTATCGCC CGGCTCTCCG
GCAGAAAAAG CGGGGTTGAA ACGGGGCGAC GTCATTACCA AAGTGAATGG CCAGTTACTC
AACACCACGA ATTATTCGGA TCTGTTATTT ACGGGCACGA CGTTTACGTA CGGGCTGGGT
ACAGTAAGCG GTAATTCAAT CGTTGATTCT GACCAGACCC GCAGCGTAAC GGCGATAGTG
TTTCAGGAAA ACCCCGTGTT TCTGGACTCG ATCTATACGG TTGGTTCTAA AAAAGTCGGA
TATCTGGTCT ACAATCAGTT TGTTCCCGGT GCGAATGGCA GCAAAGCCAA CGAATATGAT
GCGCAGGTCG ATGCCATATT CAGTAAATTC AAATCCCAGG GGGTCAATGA ACTGGTGCTG
GATTTACGGT ATAACCCGGG CGGCTATACG TCCTCGTCTG CCAATCTGGC CAGCCTGATC
GGAAAGGGTA TTAACTCCAG TAAACTTTAT TTCCGTGAAG AATGGAACAG CACCATTACT
CCTTATTTGC AGAAGGAGTA CGGCAGCAGC TTCTTTATTC AGAACTTCCT TGATAAACCC
CAAAACATAG GCAATAACCT ATCACGGGTA TTTGTTCTCA CAACGGATCA AACTGCTTCG
GCCAGTGAGT TAATTATCAA TGGTCTTCGC CCGTACATGA CCGTCACAAC GATTGGCACG
ACCACGTACG GCAAAAATGT GGGCTCGATC ACCGTTACCG ATGAGACTGG CAAAATTAAG
TGGGGAATGC AACCCATTGT GTTCAAATCG TACAACAATG CAGGCCAGTC TGACTACTCA
ACAGGGTTTA CGCCCAACAT TGAGGTCGAC GAAACGATGC CGCTGTTACC ACTGGGCGAT
ACGAACGAAA ACTTGCTGAA CGCAGCCCTG AATCAGATTT CGGGAAATGT TGCTGGCGGG
CGCCGGGCGG CTGTACGAAA TCCATTTATA CAGATGGGTT CATCAATTCA GCGGAAAGCC
GGTGGTCAAT CCATGATACG GGCAATAAAG AACCTGAAAT TATAA
 
Protein sequence
MKQHVVNKAQ RATLNQVIGQ HKSLLVVVLL GFIGSLSGCK KTTDDVTPQT TTATVNENTT 
VDSWILANMR DLYYWNDKIP ANPDTTLAPD VFFDSILNKY NATTNPTGDR FSWIENDANT
LTAELSGEST TTGMNFNLYL RASGSTGVIA QVLYVSPGSP AEKAGLKRGD VITKVNGQLL
NTTNYSDLLF TGTTFTYGLG TVSGNSIVDS DQTRSVTAIV FQENPVFLDS IYTVGSKKVG
YLVYNQFVPG ANGSKANEYD AQVDAIFSKF KSQGVNELVL DLRYNPGGYT SSSANLASLI
GKGINSSKLY FREEWNSTIT PYLQKEYGSS FFIQNFLDKP QNIGNNLSRV FVLTTDQTAS
ASELIINGLR PYMTVTTIGT TTYGKNVGSI TVTDETGKIK WGMQPIVFKS YNNAGQSDYS
TGFTPNIEVD ETMPLLPLGD TNENLLNAAL NQISGNVAGG RRAAVRNPFI QMGSSIQRKA
GGQSMIRAIK NLKL