Gene Slin_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3043 
Symbol 
ID8726795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3697606 
End bp3698805 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content53% 
IMG OID 
ProductPeptidoglycan-binding lysin domain protein 
Protein accessionYP_003387853 
Protein GI284037923 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.592375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.198967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TTCCTCTTTA TATCCATCTG AGTCTGCTGG CGGGTATGGC ATTCACCCAG 
GTTCTGGCGG CCCGCCCGCC CTTGCCAACC AGCAAAACAG TTGCCGATTC GATAGGTGTT
GAAAAGAAAG ATGGCAAACG GTTCATTCTG CATCGCGTTG ATGAAGGACA AACGTTGTTC
GGTATCGCCC GTCGTTATAA GTGTTCCGTT GCCGACATCC GGTCTACAAA TCCAGAACTG
AAAGATGGCG TCAAATACGG CCAGGTTGTT CGAATCCAGA TTCCGGACGG TGCCCTGACC
CGTAAGGAAA AAAAAGTTAT CGACAAAGCG ATAAAGAAAC AGGAAAAGGA GGAAAAAAAG
GACGCGAAAG TAGCCGCGAC CCCAAAGCCT AAAGAAATAA AACCGGCAAA ACAGGCAGAG
AAAACCGTTA TCAGAAATGA TGATCCGGCA AAAGCGGGTA TCCATGTTGT TGAAACCGGA
CAAACATTAT ACAGTCTGGC TGGTTGTTAT GGCGTCTCGC AGGCCGACAT TCGAAAATGG
AATAACCTTC CGGGCAACAA TGTGCTCATC GGACAGGCAC TTATCGTTTC GGAAAAGGCG
TACCAGGCGC GTATGCCTGC TACGCCAACC CCCGCAACAA CGTCTAAACC GGTTGATCCC
CCCCGGCGTA CTGAACCGGT TGCGTTGACG CATACCGCTG GTCGGCCGGC AGAGTCCCAT
AAATCCGAAT CCCGGCCAGC AGCACCACCG GCCGACACCA AACCGGAACA CCACGCGGAG
ACACCGTCGG CTCCTGCTGC CACTAAACCG GTCGATAGCA AGCCCGCTGA CTCAGAAGCG
AAAGCCACTG CCAAACCTGC CGACAAGCCA GTTGACGAAG TGGAACTGCC GCGGCCCGGC
AATGATGCAC CCATGCCAAC GCGGGGTCGG CGGATTTCCA GCAGCGGGGT CGCCGAGATG
ATTGAAGGAA ATGATGGCTC GGGCAAGTAC CTCGCCCTGC ACCGAACGGC TCCCATTGGC
ACGCTCGTAC AAGTCCGGAA TGAGTTCAAT AACCAGAGTA TCTGGGTAAA AGTCATTGGC
CGACTACCCG ACACGGGTGT CAATGACAAG ATCTTAATTA AATTGTCGGC GCAGGCCTTC
GCGAAACTTT CGCCCGTAGA TCGACGTTTT CGGGCTGAGG TCAGTTACAT TGTTCGATAG
 
Protein sequence
MKKFPLYIHL SLLAGMAFTQ VLAARPPLPT SKTVADSIGV EKKDGKRFIL HRVDEGQTLF 
GIARRYKCSV ADIRSTNPEL KDGVKYGQVV RIQIPDGALT RKEKKVIDKA IKKQEKEEKK
DAKVAATPKP KEIKPAKQAE KTVIRNDDPA KAGIHVVETG QTLYSLAGCY GVSQADIRKW
NNLPGNNVLI GQALIVSEKA YQARMPATPT PATTSKPVDP PRRTEPVALT HTAGRPAESH
KSESRPAAPP ADTKPEHHAE TPSAPAATKP VDSKPADSEA KATAKPADKP VDEVELPRPG
NDAPMPTRGR RISSSGVAEM IEGNDGSGKY LALHRTAPIG TLVQVRNEFN NQSIWVKVIG
RLPDTGVNDK ILIKLSAQAF AKLSPVDRRF RAEVSYIVR