Gene Slin_5185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5185 
Symbol 
ID8728951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6329562 
End bp6331304 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content52% 
IMG OID 
Productoligoendopeptidase, M3 family 
Protein accessionYP_003389956 
Protein GI284040026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.554474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAA ATCAAACCCT GAATCTTCCG GCCCGGCCAA CTCGTTCTTT TATTGGTGAA 
ACCATCGAAC TAACCAGTTG GGACGATGTA AAGCCTTTAT ATGAAGAACT GCTGGACCGA
GCCATCGACA ATGCCGACGA GCTTAAGCAA TGGCTCATCG ACCGGAGCGA GCTGGAATCG
TACCTATCCG AGAATTTTGC CTGGCGTTAC ATCCGCATGA CCTGCGACAC GGCCAATGAG
GAACTGGTTA ATCAGCTGAA TTTCTTCATT GCCGAAATTC AGCCGCCCAT GACCACCTAC
GGTAATGAGC TGGACCTGAA AGCGGTTAAC AGTCCTTTTC TGAGTCAGCT GACCGATGAT
GGGTACGACG TAATGGTACG GGGCATGAAA AAAGCCATCG AAATTTTTCG GGACGAAAAC
GTGCCGCTCC AAACCGAACT ACAGACCGAA GAACGCAAAT ACGGCGCTAT TGTTGGGGCC
ATGACGGTGC AGATCGATGG GCGCGAAATG ACCCTGCCCG AAGCCAGCGA CCGGCTCCAG
TCGACCGACC GGGCCGTGCG GGAAGAAGCT TGGCGCAAAA TCTGGGAGCG ACGTTTTCAG
GATCACGAAA CGTTGGATCA ACTGTTCGAC CGATTGCGCG ATTTACGGCA TCAGATAGCC
GTCAATGCAG GCTTTGCCAA CTTCCGCGAT TATTCGTTTG CTGCTCTGGG TCGCTTCGAT
TACACGCCGG AAGATTGTAT CAACTTCCAC GAGTCGGTTG CTGAAGCTGT CGTGCCTTTA
CTAAACGAAC TGGCCGAAGA ACGTAAGAAA AAGCTATCGG TCGATCCCCT GCGTCCCTGG
GATGCCAAGG TGGATGTGGA AGGTCGCTCA GCGCTAAAGC CGTTTGCTAC GGGAGCCGAA
CTGCTGGAGA AAACGATTAC CTGTTTCGAC CGGCTCGACA AGGAACTGGG CGACGATTTG
CGGATCATGC GGGCCATGGG CCACCTCGAT CTGGAGTCAC GGAAGGGCAA AGCACCCGGC
GGGTATAATT ATCCCCTCGA AGAAATTGGC GTACCGTTTA TTTTTATGAA CGCCACCTCC
AGCCTGCGCG ATCTGGTAAC GATGGTCCAC GAGGGAGGGC ACGCTGTTCA CTCCTTCCTG
ACCCGCGATT TATCGCTTAA AGCGTTCCGT AATCCCCCGA TGGAAGTTGC CGAACTGGCT
TCCATGAGTA TGGAACTGCT GTCGATGGAC CATTGGGACG TGTTTTTCGA GAACCCCGAG
GAACTGCGCC GGGCTAAACT TCAGCACCTG GAGTCGATTA TTGAAACGCT CCCTTGGGTA
GCTACCATCG ACAAATTTCA GCACTGGATT TATGAAAACC CCACGCATAC CGATGCCGAA
CGACGGGAAA ACTGGGTACG TATATACAAT CAGTTTGCCG ATACAGTTAC CAACTGGAAT
GGCTTTGCCT TTTATCAGGA ATACCTGTGG CAGCGCCAGC TTCATTTATA CGAAGTACCG
TTCTATTACA TTGAGTACGG AATTGCTCAG TTAGGGGCTA TCGGAGTTTG GCGAAATTAC
CGTCGCGATC CAAAAGCGGG TTTGGATGGA TACAAGCAGG CGCTGAGTCT GGGCTACAAA
GCACCCATTC GGGAGATTTA TGCAGCAGCG AACGTACCCT TCGACTTTTC GCAGGAGCAT
ATCCGCGAGC TAATGGGCTT TGTATGGGAG GAGATTGAAA AATTAACCGG AACTAGCCGG
TAA
 
Protein sequence
MTANQTLNLP ARPTRSFIGE TIELTSWDDV KPLYEELLDR AIDNADELKQ WLIDRSELES 
YLSENFAWRY IRMTCDTANE ELVNQLNFFI AEIQPPMTTY GNELDLKAVN SPFLSQLTDD
GYDVMVRGMK KAIEIFRDEN VPLQTELQTE ERKYGAIVGA MTVQIDGREM TLPEASDRLQ
STDRAVREEA WRKIWERRFQ DHETLDQLFD RLRDLRHQIA VNAGFANFRD YSFAALGRFD
YTPEDCINFH ESVAEAVVPL LNELAEERKK KLSVDPLRPW DAKVDVEGRS ALKPFATGAE
LLEKTITCFD RLDKELGDDL RIMRAMGHLD LESRKGKAPG GYNYPLEEIG VPFIFMNATS
SLRDLVTMVH EGGHAVHSFL TRDLSLKAFR NPPMEVAELA SMSMELLSMD HWDVFFENPE
ELRRAKLQHL ESIIETLPWV ATIDKFQHWI YENPTHTDAE RRENWVRIYN QFADTVTNWN
GFAFYQEYLW QRQLHLYEVP FYYIEYGIAQ LGAIGVWRNY RRDPKAGLDG YKQALSLGYK
APIREIYAAA NVPFDFSQEH IRELMGFVWE EIEKLTGTSR