Gene Slin_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4398 
Symbol 
ID8728158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5333791 
End bp5335218 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content53% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003389178 
Protein GI284039248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTAA AATTTCCCCA CGCTCCTCAT GTTGTTCGCT GGTCAGTAAT CGGGCTGGTC 
GTCTTTGGCT GGGTGTGGTC AATGCCAGCG GCACAGGCGC AGCAGACAAT GCGTGACCGG
CAGGTACTGG AGAAAGAGAA AAAACAGAAT CTGGAGAAAA TGAACCAGAT TCGGACCATT
CTGAAACAGA CAGCTTCCGA AAAACAGGCC GGACTAGGCC AGTTAAAGGC TCTGGACCAG
CAAATTCAGA CGCAGTCGCA GCAGATTGGG TTGCTTAATA AAGATCTTCA GCTAACCGAA
TCGGAAATCG CCGAACTCCG GCGAGCTAGT ACAACCCTAT CCAGAGACCT TGATAAACTC
AGGGATGAAT ACGGCTCCAT GATTTACGCA GCCGATAAGC GCCGACAACA GGTCAATCCA
CTGGGTTTTT TATTTGCCGC CGATAACTTC AATCAATTGG TTGCCCGTTA TCGCTATTTG
CGGCAGTACT CCGATGCTCG CCAGAGTCAG GTACGGCAGA TGTCGAACGT TAGGACGATG
CTGGACGGTA AACAACGGGC CACTCAGCGA AAACGCCAGG AGCAAAAGAA TACGATTGGG
GCCAAAGTGC AGGAAACCAA AAGTCTTGAA ACCCTTAAGG TAGTTAAAAA CCAGGTTGTT
AAGGAACTGG GGCAGAAAGA GGCCGAACTG AGAACCGAAC TGGCCGAAAG CCGCCGAGCC
GTTGGACAAC TTGAAGCGTT GATCAAACGA CTAATTGTCC GGGAAGCCCG CGAACGGGCC
GAACGTGAAG CTCGCGAACG GGCTGAGCGT GACCGAATTG CCCGGCTTGA AGCCGCCCGC
AAAGCAGCGG AACGCAAACG AGCTGAAGAC GCCATTGCGG CTGCCGAAAA GGCCGGAGAA
AAACCCGCCC CGGCCGACGT AGCCAAAGTA GAACGACCCG CAGAACCGGA ACCAGCGCCG
AAAAAACCGG ATGAACGCCG GAATAATAAC CTCAATGACG AAGAAACGGC ATTGGCTTCG
TCCTTTACAT CATCGCGGGC GCACCTACCC TGGCCCGTAA CCAAAGGCTT CATTTCTGAC
CGTTTCGGCC GAAAACCACA CCCGGTCCTT AAGGGGATCT ATGTGGAGAA TCAGGGAGTT
GATATTCAGA CAAACGCGGG CGAGGGTGTC CGGTCGGTAT ACGATGGCAT TGTACAGGAT
GTGACCAGCA TGCCGGGTAT GAACAATGTA GTGGCCATTC AGCATGGCGA TTACTTCACG
GTTTACGCCA AACTTCGCAG CGTATCGGTT CGGGTTGGGC AACGGGTAAA AGCCCGCGAA
TCAATCGGTA CCGTAGCAAC TGATAAAAAC GGGGTATCCG AAATCCAGTT TCAAATCTGG
AAGGAGTTTA CCAAACTCAA CCCCGAGTCG TGGCTCACTC CCCGCTAA
 
Protein sequence
MQLKFPHAPH VVRWSVIGLV VFGWVWSMPA AQAQQTMRDR QVLEKEKKQN LEKMNQIRTI 
LKQTASEKQA GLGQLKALDQ QIQTQSQQIG LLNKDLQLTE SEIAELRRAS TTLSRDLDKL
RDEYGSMIYA ADKRRQQVNP LGFLFAADNF NQLVARYRYL RQYSDARQSQ VRQMSNVRTM
LDGKQRATQR KRQEQKNTIG AKVQETKSLE TLKVVKNQVV KELGQKEAEL RTELAESRRA
VGQLEALIKR LIVREARERA EREARERAER DRIARLEAAR KAAERKRAED AIAAAEKAGE
KPAPADVAKV ERPAEPEPAP KKPDERRNNN LNDEETALAS SFTSSRAHLP WPVTKGFISD
RFGRKPHPVL KGIYVENQGV DIQTNAGEGV RSVYDGIVQD VTSMPGMNNV VAIQHGDYFT
VYAKLRSVSV RVGQRVKARE SIGTVATDKN GVSEIQFQIW KEFTKLNPES WLTPR