Gene Slin_6521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6521 
Symbol 
ID8730307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7911805 
End bp7913934 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content51% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003391277 
Protein GI284041347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT ATCTGGTGAC CCTAGTGCCC GTGCTGATGC TGAGTTTTCA GCCGGATTCG 
CCCTCAAATG GTGCCCCCAA AACACATGGG CAAACTATTT CTGCAGGCGT TGTGAATGGT
GATAAAAACG GAGGACCCGT TGAGGAACTG AAGCCATCGA TCTCGCAGGA GAAAGTGGAA
ACGCTGGTAG CGAAGCTGCT AACGACTTAC CACTACCGTA AGGTAAAACT CAACGACTCC
CTCTCATCGG TGGTTTGGGA CAACTATCTG AAAGAGCTGG ATGGCAACAA AACCTATTTT
CTGGCGTCGG ACGTAGCGTC TTTTGAGAAG TACCGTTATC AGATTGATGA TGCCCTCATT
AACGGCGACC TGACGGCAGC TTATGATTTG TTCAACCTGT ACCGGAAACG GTATCAGGAG
CGAAGTGAGT TTGTTAAAGC GCAAATCAAA AAGCCGTTTA CCTTTACCAG CGACGAAAAC
TTCAACACCG ACCGGGAGAA AATGCCCTGG CCTAAAACGG TGGAAGAGCA GAATGACCTG
TGGACCAAGA TCCTGAAAAA CCAGGCGCTC GAACTCAAGT TAGGCAACCG GAAAGATAGT
GCCGTGGCCG CCCTGATGAC CCAGCGGTAC AACAACCTCG ATAAGGCTAT CAATCGGGTG
AAAAGCGCGG ATGTATTCCA GATGTACATG AACTCGTTTG CCGAAGCACT CGACCCGCAC
ACCAACTACC TGTCGCCAAG CTCTGCCGAT AAGTTCAATC AGGACATGAG TCAGTCACTG
GAAGGTATCG GTGCGATTTT ACGCGAAGAT GGCGATTACA TCCGTATCAT GGACGTTTTG
CCCGGTGGCC CGGCGTTCAA GAGCAAACTG ATCAACAAGG ACGATAAAAT AGCCGGTGTT
GCTCAGGGTG ATAATGGCCC GATGGTCAAC ACCATGAACT GGCAGGTCGA CGAAGTGGTG
AAGCTCATCA AAGGGCCAAA AGGAACGATT GTTCGCTTAC AGGTCATCTC GCCAAACTCG
CTGGCGGGCG CTCCACCAAA GGAGATACGG CTGGTTCGGG AAAAGATAAA ACTGGAAGAA
CAACGCGCTA AGAAAGAAAT CATTGAAGTG ACGGATAACG GCAAACCGTT CAAGATTGGC
GTTATCAACA TTCCTATATT CTACCGTGAT TTTGAAGGGG CCCGCAAGCG CGAGGAAGGC
TTTAGCAGCA CGACGAGCGA CGTAAAGAAA TTTGTAGAAG AGCTCAAAGC CGAGAAAGTT
GACGGTATCG TCATTGACCT TCGCGACAAT GGCGGTGGCT CACTGGTGGA GTCCATTAAC
CTGACTGGCC TGTTTATTCC GAAAGGCCCC GTTGTACAGG TGCGCGAAGC AACCGGCGAA
ACGGAAGTGT ATACCGACCC CGACCCGTCC GTTACCTACG ATGGCCCGAT GGCCGTTCTG
GTAAACCGTT TTAGTGCGTC GGCGTCAGAG ATTTTCGCGG CTGCTATTCA GGATTACAAG
CGGGGTATTA TTGTGGGTGG ACAAACTTAT GGCAAAGGCA CGGTACAAAC GATGATCGAC
CTGAACCAAT GGCTGCCTAA AGAGCCCGAA AAAGTTGGTC AGGTAAAAAT GACCATCCAG
AAGTTCTACC GCATTAATGG CAGCAGCACA CAGCACAAAG GGGTAACACC CGACGTTCAA
CTGCCATCGG CATTCTCGGC TGAAGAGTAT GGCGAAAGTT CACAGCCAAG TGCGCTGCCC
TGGGATCAAA TAAATTCGAC CCGATACGAG CAGTCGCGCG GCATTGATGA CAAAATCCTG
AGCCGCCTTC GTGACCGTTT CGATCAGCGC CTGAAGTCGG ACCCTGAACT GAAGCAACTA
GCCCAGGATT TGGCCGACTT CAAGAAAGCA AAAGAAAATA CCGTTGTCTC GCTACAGGAG
GCCAAGCGCC GGAAAGAGCG AGATGAAGCG GAACGTAAGC GCACGGCTGC TAATAAGGTA
TCCCAGATAT CAGCCTCGGG GGATGAAGCT GAACCAGCCA CCACCGGGAC AGCAGCAACT
CCCAAGAAGA AAAAAGACCT GTATCTGAAC GAAGCAGGTT TGGTATTGGC AGATTACATC
ATGGCCACCC ATCTGGCCGT GAACAAATAA
 
Protein sequence
MKKYLVTLVP VLMLSFQPDS PSNGAPKTHG QTISAGVVNG DKNGGPVEEL KPSISQEKVE 
TLVAKLLTTY HYRKVKLNDS LSSVVWDNYL KELDGNKTYF LASDVASFEK YRYQIDDALI
NGDLTAAYDL FNLYRKRYQE RSEFVKAQIK KPFTFTSDEN FNTDREKMPW PKTVEEQNDL
WTKILKNQAL ELKLGNRKDS AVAALMTQRY NNLDKAINRV KSADVFQMYM NSFAEALDPH
TNYLSPSSAD KFNQDMSQSL EGIGAILRED GDYIRIMDVL PGGPAFKSKL INKDDKIAGV
AQGDNGPMVN TMNWQVDEVV KLIKGPKGTI VRLQVISPNS LAGAPPKEIR LVREKIKLEE
QRAKKEIIEV TDNGKPFKIG VINIPIFYRD FEGARKREEG FSSTTSDVKK FVEELKAEKV
DGIVIDLRDN GGGSLVESIN LTGLFIPKGP VVQVREATGE TEVYTDPDPS VTYDGPMAVL
VNRFSASASE IFAAAIQDYK RGIIVGGQTY GKGTVQTMID LNQWLPKEPE KVGQVKMTIQ
KFYRINGSST QHKGVTPDVQ LPSAFSAEEY GESSQPSALP WDQINSTRYE QSRGIDDKIL
SRLRDRFDQR LKSDPELKQL AQDLADFKKA KENTVVSLQE AKRRKERDEA ERKRTAANKV
SQISASGDEA EPATTGTAAT PKKKKDLYLN EAGLVLADYI MATHLAVNK