Gene Slin_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5214 
Symbol 
ID8728980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6366400 
End bp6367995 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content52% 
IMG OID 
Productpeptidase M28 
Protein accessionYP_003389985 
Protein GI284040055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTC AACAACTAAT GCTGTCGGCG GGGCTTATTT CCCTGCCTTT TGTGAGTAGT 
GCGCAGGAGA AATTTGCCAA TACCATTACC GCCAGCGACC TTGAAAAACA CCTCCGCGTG
CTGGCCGCCG ACGATATGGA GGGTCGCGAA ACCGGAACCC GCGGGCAACG CAAAGCCGCC
GAATACATTG CCACTCAATT TGCTGCCGAG GGAATGAAAC CCATCGTCAA AGCCGACGAT
GGCAAGCTGG TTTATCAGCA ACCCTACACG CTTTACAAGA AAAACTGGGG TGATTTTTAC
GTCAGTGCGG GTGGCAAACG GTTCGAGCCT TCCAAAGACT TTATGCCCAA TGGCCTGCTG
TACTTACCCA CCGAAACGAG CTACGAAACG GTATTTGTCG GTTACGGCAT TGGCGACGCC
AATTACGATG ACTATGCGGG ACGTGATGTA AAAGGCAAAG CCATTGTTGT TCTGGACGAT
GAGCCTAAGA CCGCCGATGG CAAAAAACTC GTCAGTGGAA ACACCGAAGC GTCCAAATGG
GGCGGGCCGA ACGGCTGGCG GGCTAAAAGT CTGTTGGCTA AAGAAAAAGG AGCAGCTCAA
TTGTTCATTG TTTCAGCCGA ATCAGCCGAA GCGTTTAAGC AGTTGCTTAC CCAGCGTAGT
GCCATGCAGG CCCGTTTCAA CCGGCTCAGC CTGAAGGCAG GGGCTGAAAA TATAGGCTCT
ATGGGTGTTT TTCTGGTCAC CGCCGATATG GCTGCCAGTT TACTGAATAC ATCCACCGTT
ACGTTAACGC AGACGATGAC ACAGTTGGCT CAATCGGCCA AACCGGTTGC GTCGTCTCTG
GCGGGCAACG TAGCCGTTAA AGCCGACCGC GTGGATGAAA AAACAGAATC GTCGAACGTG
CTGGGCTTCA TTGAAGGGAC CGACAAAAAA GACGAAGTGT TGGTTGTTTC GTCGCACTAC
GACCACATTG GCATCAGCGC CGACGGCCAG ATCAACAACG GAGCCAACGA CGACGGGTCG
GGAACAGTAT CCGTGCTGGA AATTGCGCAG GCGTTTGCCA AAGCCAAAGC CGCTGGAAAA
GGTCCCCGTC GGTCTATTCT GTTCCTCACC GTTTCGGGTG AAGAAAAAGG ATTGCTCGGC
TCGCAATACT ACGCCGATAT GAATCCGGTT ATTCCACTGG AGAAAACCGT AGCCGACCTG
AACATCGACA TGGTAGGCCG GGTCGATGAC CTGCACCTGG GCAAATCCGA TAACTATATT
TATGTGATTG GTTCAGACAA GCTATCCTCA GAACTGCATA AGATCAGCGA GGAAACCAAC
AAGAAGCACA TTAATATGGA GTTGGATTAT AAGTACAACG ACCCGCAGGA TTCGCAGCGC
ATTTACTACC GCTCGGATCA CTACAACTTC GCCAAGCACC AGATTCCCAT CATCTTCTAT
TTCAACGGGC TGCACCCGGA TTACCACAAG CCAACGGACG ACATCGAGAA AATCGACTTC
AAACTAGCCG AAAAATCCGC CCGACTCGTG TTCTACACCG CCTGGGAAAT CGCCAACCGC
GACCAGCGCC TGGTGGTGGA TAGTAATAAG CAGTAG
 
Protein sequence
MTFQQLMLSA GLISLPFVSS AQEKFANTIT ASDLEKHLRV LAADDMEGRE TGTRGQRKAA 
EYIATQFAAE GMKPIVKADD GKLVYQQPYT LYKKNWGDFY VSAGGKRFEP SKDFMPNGLL
YLPTETSYET VFVGYGIGDA NYDDYAGRDV KGKAIVVLDD EPKTADGKKL VSGNTEASKW
GGPNGWRAKS LLAKEKGAAQ LFIVSAESAE AFKQLLTQRS AMQARFNRLS LKAGAENIGS
MGVFLVTADM AASLLNTSTV TLTQTMTQLA QSAKPVASSL AGNVAVKADR VDEKTESSNV
LGFIEGTDKK DEVLVVSSHY DHIGISADGQ INNGANDDGS GTVSVLEIAQ AFAKAKAAGK
GPRRSILFLT VSGEEKGLLG SQYYADMNPV IPLEKTVADL NIDMVGRVDD LHLGKSDNYI
YVIGSDKLSS ELHKISEETN KKHINMELDY KYNDPQDSQR IYYRSDHYNF AKHQIPIIFY
FNGLHPDYHK PTDDIEKIDF KLAEKSARLV FYTAWEIANR DQRLVVDSNK Q