Gene Slin_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3923 
Symbol 
ID8727681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4705687 
End bp4707360 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content53% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003388712 
Protein GI284038782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0829863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.226226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCT CTAAACGACT CACCCTGTTA GCCTCCTCAG CCCTTGTAGC GGGTGGCATT 
GGTTTCTTCT CGTTCAAGAC GGACGACCGC TTCTTCGAGA TCGCGCGCAA CCTGGACATC
TACGCCACGC TGTTTAAGGA ACTCAATCTC TATTATGTAG ATGAGGTAAA CCCGAACCGC
ATGGTTAAAA CCAGCATCGA CGCTATGCTG AAAGCCCTCG ACCCGTACAC GAACTTCTTC
GCCGAAGACG AGATCGAGGA TTACATGACC ATGACCACCG GCCGCTACAA CGGCATTGGG
GCGCTCATTG GCCAGCGGCA GGGCAAAAGC ATCGTGCTAA TGGTGTATGA AGGCACCCCC
GCCGAAAAAT CGGGGTTGCA AATCGGTGAT GAAGTGCTTA AGGTGGACGG TGTAGACCTG
AAAACCCGCA AAGACCGCGA TGGCGGACCA CTTGATCCGG GTAAACTCCT GAAAGGGCAG
AACAACACGG CCGTAAAACT GACCGTTAGT CGGTACGGAC AAAAAGCCCC GCTCGAACTC
AGCGTTATCC GGGATGTGGT TAAAATGACC AACGTGCCTT ACTACGGCAT GGTATCGGAC
GAAGTGGGAT ACATCGACCT CAAAGATTTT ACGGCCACGG CTTCGCGTGA GGTACGGACC
GCCTATCAGG AACTGAAGGG GAAAGGGATG AAAAAACTTA TCCTCGACGT TCGTGAAAAT
CCGGGCGGAC TGCTCAACAT GGCCATCGAC ATCTCGAATA TTTTTATTCC GAAAGATTCA
GAAGTGGTGA CGACTAAAGG TAAAGTGACG GAGTGGAACA AGACCTACAC CGCCATGAAC
CCACCCCTCG ACCTCGACAT TCCTATTGTT GTGCTGACAA ACAGCCACAG TGCATCGGCG
GCCGAGATTG TATCGGGGGT TATTCAGGAT TACGACCGGG GCGTGTTGAT CGGGCAGCGG
ACCTACGGCA AAGGGCTGGT GCAGACCACT CGGGAATTGT CGTTCAACAC CAAGCTAAAA
ATCACAACGG CCAAGTATTA CATTCCGAGT GGCCGGTGCA TTCAGGCCAT CGACTACAGC
CACCGCAACG CCGATGGCAG CGTGGGCAAG ATTCCGGATT CGCTGAAAAC CGCTTTCAAA
ACCAAGGCGG GCCGGGTAGT ATACGACGGC GGTGGCGTGT TGCCCGATAT TGTCGTAGAA
GCGCAGACAC CCTCGCCGGT GGCCCTGAGC CTGACAAACA AAGGCCTGAT TTTCGATTAT
GCCGTGAAGT ACCGACACGA GCATGCTAGC ATTAAACCAG CCCGCGAATT CCGCCTGACC
GATGCCGAGT ATACTGAATT TGCGAAGTGG CTCGGCGATA AAGAGTACGA TTATACGACG
CAGGTCGAGA AAGACTTGGG TACCCTCGAA GCATCGGCCA AGAAAGAGAA GTATTTCGAC
CAGATTCAGG ATCAACTGAA GTCGCTGAAG ACCAAAATGT CGCACAGCAA AGATGCCGAC
CTGAACACCT TCAAGCCAGA GTTAAAAACC CTGCTTGAGC AGGAAATAGC CGGGCATTAC
TACCTGCAAA AAGGCATCAA GGAAGCCTCG TTCGCTACCG ATCCCGAAAT GAAAGCAGCC
CTTGACCTGT TCAAAGACAT GAACCGGTAC GGTACCATCC TGAAGGGAAA GTAA
 
Protein sequence
MRFSKRLTLL ASSALVAGGI GFFSFKTDDR FFEIARNLDI YATLFKELNL YYVDEVNPNR 
MVKTSIDAML KALDPYTNFF AEDEIEDYMT MTTGRYNGIG ALIGQRQGKS IVLMVYEGTP
AEKSGLQIGD EVLKVDGVDL KTRKDRDGGP LDPGKLLKGQ NNTAVKLTVS RYGQKAPLEL
SVIRDVVKMT NVPYYGMVSD EVGYIDLKDF TATASREVRT AYQELKGKGM KKLILDVREN
PGGLLNMAID ISNIFIPKDS EVVTTKGKVT EWNKTYTAMN PPLDLDIPIV VLTNSHSASA
AEIVSGVIQD YDRGVLIGQR TYGKGLVQTT RELSFNTKLK ITTAKYYIPS GRCIQAIDYS
HRNADGSVGK IPDSLKTAFK TKAGRVVYDG GGVLPDIVVE AQTPSPVALS LTNKGLIFDY
AVKYRHEHAS IKPAREFRLT DAEYTEFAKW LGDKEYDYTT QVEKDLGTLE ASAKKEKYFD
QIQDQLKSLK TKMSHSKDAD LNTFKPELKT LLEQEIAGHY YLQKGIKEAS FATDPEMKAA
LDLFKDMNRY GTILKGK