Gene Slin_5254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5254 
Symbol 
ID8729020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6406134 
End bp6407645 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content54% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003390024 
Protein GI284040094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000438932 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG GTAAAACAGT TGCATTGATA GGCTTATTTG TCGGCTTAAG TTATCAGATT 
CTATCAGCCC AGGTGCTCAC GCCGGCACAG GCTCGTACCG ACCTCAGTTA TTTAAAGCGA
AAGCTCGATC TGCTCCATCC CGGCATGGGC TACTACACGC CACAGCCCCG GATGGAGCAA
TTGTATGATT CGTTATACAA CCGGCTAACG GCGCCCTACG ACTACATGGC ATTTTTCCAC
CACGTCAGTC CGCTGGTAGC CGCCCTGAAA GACGGCCACA CCAACCTGAA TCACCGGAAA
AATTACATCG GTAAGTCCAC GCGTTTTATT CCGTTCTACA TCCGGCTGGT CGATTCACAA
TATTATATAA GCCACAACGT ATCGGCCGAT AGCAGCCTGC AACGCGGCAC CGAATTGCTA
TCCATCAATG GGAAACCGGT AGCCGATGTC CACCGCGAAC TCATGAATAC GGATCACTCC
GGCTCCGATG GCGACAACCT CACCGGACGG CGGCAGTGGA GTATGGTTCA GTTTGCCGAT
TATTACGCAG CCTGGTTCGG CTCGGCCGAT TCGGTTACGA TCACCTACCG GCTCACCGGC
GATACGCTCA TTCGGCAAAC GCGGGTGCGG TGTCTGAGTC TGGCCAGTTT CCGGTCTACC
ATCCAGCGCC GGTACGGAAC CGAGCTTGAT CATCGCCCTA ACCTATCCGT TCGAATCGTC
GATACGCTGA CCCGAACGGC TGTACTGCGG GTATCGTCAT TTATGGGATT TAAAAAGAAT
GACCCGTTTC AATGGGCGTA TAACCGGCGG CTGAAACGGG CGTTCAAAAC GCTGCGTGAG
CAGAATATTC AAAATTTAAT TGTCGATATG CAGGGCAATG GTGGCGGTAT TGTGGTGAAT
TCGGCCCGGC TGCTTCGGTA CTGGATGCCG AAACCGTTCC GGATTATGGA TCATGAAGAA
ATGAAACAGG CCGCCCGCGC CGAACTGGTT ACGCGCTGGA ATCCTTTCTC GGCCCTGAAT
TTCAGTCTGC AATACCGAAC CGATAAATCG GGGGGATTTG CCAGCCGATC CGGTAACCGC
CGATACCGCC CACGGCACCG GGATGCGTTT CGGGGCAATA TTTACTTCAT GCAAAACGGC
GCGTCGTTCT CGGCGACGAC AACGGTACTG GCCAAAACCC TCGATGCCGG TATGGGTACT
TTCGTTGGCG AAGCGAGCGG CAGTGCGTAT TGGGGCGACT TTGCCGGGCA TTTCAAAACG
GTAACCCTAC CCAATTCCCG GCTTCAGGTA CGGATTCCGC TCAAAAAACT GACGCACGCG
GTTAACACCG AGCGGGCCAA CGGCTTTACC GTCGAACCCG ATTTTATCGT CACCCGCAGT
TTCGACGACC TGATGGTGAA CCGCGACTAT ATTTTAGAGT ACACATTACG GCTTATCCGG
GAGGGTGTTG TGGTTCGGCC GGGGCCAGAG AAACAGCCGA TTCGTAACCG GTCGTTGCAG
GCTTCGCGGT AA
 
Protein sequence
MKIGKTVALI GLFVGLSYQI LSAQVLTPAQ ARTDLSYLKR KLDLLHPGMG YYTPQPRMEQ 
LYDSLYNRLT APYDYMAFFH HVSPLVAALK DGHTNLNHRK NYIGKSTRFI PFYIRLVDSQ
YYISHNVSAD SSLQRGTELL SINGKPVADV HRELMNTDHS GSDGDNLTGR RQWSMVQFAD
YYAAWFGSAD SVTITYRLTG DTLIRQTRVR CLSLASFRST IQRRYGTELD HRPNLSVRIV
DTLTRTAVLR VSSFMGFKKN DPFQWAYNRR LKRAFKTLRE QNIQNLIVDM QGNGGGIVVN
SARLLRYWMP KPFRIMDHEE MKQAARAELV TRWNPFSALN FSLQYRTDKS GGFASRSGNR
RYRPRHRDAF RGNIYFMQNG ASFSATTTVL AKTLDAGMGT FVGEASGSAY WGDFAGHFKT
VTLPNSRLQV RIPLKKLTHA VNTERANGFT VEPDFIVTRS FDDLMVNRDY ILEYTLRLIR
EGVVVRPGPE KQPIRNRSLQ ASR