Gene Slin_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1950 
Symbol 
ID8725687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2361075 
End bp2362193 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content52% 
IMG OID 
Productpeptidase S15 
Protein accessionYP_003386794 
Protein GI284036864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.286017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TCACATCCTT ACTGGCCGCA GGTCTTTTGC TGTTAGGACA GGCTCAAGCA 
CAATCCGGAA AACCGGTATC CAATCAGAAA ACAGAGCGCA TGAATAAGAC AGGACAACCG
AAACACTACA CGTTTCAACT CAGCGATAAG GTGACGCGTC AGGCGGTAAC CTTCAAAAAT
CGGTATGGCA TCACCCTGGC TGCCGATTTA TACCTGCCCA AAGGTCAGGC GAATACACCT
TCGGCGGCAC TTGCCATCAG CGGTCCGTTT GGTGCCGTGA AGGAACAATC GTCCGGCTTG
TATGCGCAAA CCATGGCCGA GCGGGGGTTT ATCGGCCTGG CTTTCGACCC GTCGTATACC
GGTGAAAGTA GTGGAGAACC TCGCCATGTC GCTTCACCTG ATATTAATAC GGAAGATTTC
AGTGCCGCAG TCGACTTTTT GGGTTTGCAG CCTTCGGTCG ACAGAAACCG CATTGGCATC
ATTGGTATTT GTGGGTTTGC AGGCATGGCA CTGAATGCGG CCGCCGTGGA TAAACGCGTG
AAAGCCGTGG CTACCACCAG CATGTATGAT ATGTCGCGGG TAATGGCAAA AGGTTATTTC
GACAACCTGA CGGCCGACCA ACGCACGAAC ATGCTGGAAC AAATGAGCCA GCAACGCTGG
GCCGATGCCG GGAAAGGTGC TCCAGCGCCA TCCGCCAATA ACCTCCCCGA AAAGCTACAG
GGCAACGAAC CTCAGTTTGT CGTCGATTAT TACAATTATT ATAAAACGCC ACGCGGTTTT
CATCCTAATT CAATCAATTC GAATGGGGCC TGGACGGCCA CCAATCCGCT GTCGTTCATG
AATATGCCCC TGCTGACCTA CATCAATGAA ATTGCCCCAC GGCCGGTTTT GTTGATTGCC
GGAGAGAAAG CACACTCCCG CTATTTCAGC GAGGATGCCT ACAAAGCGGT TGCCGGGCCT
AAGGAGTTAC TCATTATACC GGGTGCCAGT CACGTTGATC TGTATGACAG GCTGGATATC
ATTCCGTTTG ACAAGCTAAC CGCTTTCTTC ACAGAATCCC TGAAGCCAGG TACGCCCCAG
GAAAAGAGCG TTGGCGCACG CTCAATAGAA AACAAATAG
 
Protein sequence
MKKITSLLAA GLLLLGQAQA QSGKPVSNQK TERMNKTGQP KHYTFQLSDK VTRQAVTFKN 
RYGITLAADL YLPKGQANTP SAALAISGPF GAVKEQSSGL YAQTMAERGF IGLAFDPSYT
GESSGEPRHV ASPDINTEDF SAAVDFLGLQ PSVDRNRIGI IGICGFAGMA LNAAAVDKRV
KAVATTSMYD MSRVMAKGYF DNLTADQRTN MLEQMSQQRW ADAGKGAPAP SANNLPEKLQ
GNEPQFVVDY YNYYKTPRGF HPNSINSNGA WTATNPLSFM NMPLLTYINE IAPRPVLLIA
GEKAHSRYFS EDAYKAVAGP KELLIIPGAS HVDLYDRLDI IPFDKLTAFF TESLKPGTPQ
EKSVGARSIE NK