Gene Slin_5951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5951 
Symbol 
ID8729732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7214178 
End bp7215458 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID 
Productpeptidase M16 domain protein 
Protein accessionYP_003390712 
Protein GI284040782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCG ATAGAACACA ATCGCCCGGA TTTCAGGCTA TACAGGAAAT ACGCCTGCCA 
GCAGTACAGT CTCACCAACT GGATAACGGA ATTCCGCTGC ACCTGATTTC GGTTGCTCAG
CAGCCTGTTT TGCGGCTGGA GTGCGTATTT AATGCGGGAA CCTGGTATGA ACAGGTGCCA
GGTAGCGCAT TTTTTGCCAT GAAGATGCTG GCGGAAGGTA CACCCACACG TACATCTGCC
CAAATTAGCG AGTACATCGA CCGATACGGC GCTTTCCTGG AACTTAACAG CGGCCCCGAC
CGTGCCAGTA TTGTCATTTA CTGCCTCAGC AGGTTTTTGC CAAATGTGCT GCCTGTGCTT
CGGGAGATGC TTACTGAAGC TACCTTCCCG CAAAAAGAAC TGGACGACCT GCGGAACATC
ACCCTCCAGA ACCTGCGCGT CAATTACGAG AAGAATGCTT ATCTCGCCGG GGTTCTGTTC
CGGGAAAAAT TGTTTGGTAT CAACCACCCA TACGGGCGTA GTCAACGTCC CGAAAATGTC
GAAAAGCTTA CCCGGCAGGA TGTAGTTGAC TTCTTTAGTC AGGTTATCAG TAACCGGCCT
TTTCAGATAA TTCTGGCCGG GCAGGCCGCT GAAGATGAAC TGGCCGCGAT TAACCGTGAA
CTAGGGCAGT TAACTCTTCG TACAGACGCA CTCGCGGCAT TTGACGGAAG CGCCTATTCC
GACGACCGGT TGCCCATACT GGCTGATAAA CCGGACAGCG TTCAATCGTC AATCCGCGTT
GGTCGCCGGT TGTTTACCCG GTCACATCCT GATTTCTTTA AAATGCTTGT TACCAATGAA
ATCTTGGGCG GGTACTTTGG CTCCCGGCTC ATGAAGAATA TTCGTGAAGA GAAAGGATTT
ACGTACGGAA TCTCATCGAA TATGCCTTCG TTCCGGCAGG ATGGGTATTT CCTGATCGGA
ACGGATGTTA ACAAAGAAAA TACCCAGCAA ACGCTGGATG AGATCAGAAA GGAGATAAGT
ATCCTGCAAA CCGAGCCGGT ATCAGCGGAT GAACTGGAAA CAGTACAGAA TTATATGGCA
GGCGAATTTG TTGGATCATT GAATACACCC TTCGAAATTG CTGACCGGTA TAAAGTGGTT
TTACTGGATG GAATGCCCAC AGATTTCCTG ACAACGTATA TTCAAAAAAT TCGTCAGGTA
ACCCCAGCCG ATGTAATGGA GACAGCTAGC CGCTATCTGG CCCCCGAGGA TTTACGGGAA
GTAGTCGTAG GTGGTAAATA G
 
Protein sequence
MTLDRTQSPG FQAIQEIRLP AVQSHQLDNG IPLHLISVAQ QPVLRLECVF NAGTWYEQVP 
GSAFFAMKML AEGTPTRTSA QISEYIDRYG AFLELNSGPD RASIVIYCLS RFLPNVLPVL
REMLTEATFP QKELDDLRNI TLQNLRVNYE KNAYLAGVLF REKLFGINHP YGRSQRPENV
EKLTRQDVVD FFSQVISNRP FQIILAGQAA EDELAAINRE LGQLTLRTDA LAAFDGSAYS
DDRLPILADK PDSVQSSIRV GRRLFTRSHP DFFKMLVTNE ILGGYFGSRL MKNIREEKGF
TYGISSNMPS FRQDGYFLIG TDVNKENTQQ TLDEIRKEIS ILQTEPVSAD ELETVQNYMA
GEFVGSLNTP FEIADRYKVV LLDGMPTDFL TTYIQKIRQV TPADVMETAS RYLAPEDLRE
VVVGGK