Gene Slin_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1868 
Symbol 
ID8725605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2257544 
End bp2259070 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content53% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003386712 
Protein GI284036782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00940892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTA AAACCAGTTT GCTTTTATTG ATTTTACTCC CGGTAAGTGT TTTCGCCCAA 
ACACCGCAGC AACGTGTTCG GCAATACCGA CAGGCGCAGG AAACCGCTCT GATGGATGAA
TACCGGGAGT TTCTGTCCAT CCCCAATGTG TCTGCCGACT CGGTGAACAT CCGTAAAAAC
GCAGCTTTTA TTCTTCAGAT GATGAAGAAG CGGGGCATTT CGGGAGTGCT TCTCGATGGC
CCGACACCGG GCTCAACGCC CGCCGTATTC GGGGAGGTTC GGGTGCCGGG GGCTAAAAAG
ACGCTTGTTT TTTATGCCCA TTACGACGGA CAGCCGGTTA ACCCAAAGCA GTGGGGAGAA
GGTCTTCAAC CGTTTGTGCC CGTCTTTATC ACGGCTCCCG TGGAGCAGGG GGGCAAGATT
ATAACGACCT ATAAGTCGGG CGATCCAATC GATCCGAACT GGCGCTTATC GGGCCGGGGC
AGCGCTGATG ACAAAGCCGG TGTCATGACC ATCCTGAACG CCTACGACGC GCTGGTAAAG
TCGAACATAC CCCTCACGAC TAACCTGAAA TTCTTTTTTG AAGGAGAGGA AGAAGTTGGC
TCGACGCACC TGGGTGAGAT CTTCGAGAAG CACCGCGACA AACTGGCCGG TGATCTCTGG
ATTATTGCCG ATGGACCACG GCATGTGTCG GGTAAGCCCG TTGTGCAGTT TGGCGTTCGG
GGCGATGTGA ACATGTACCT GACAGTTTAT GGTCCTAAAC GACCCCTGCA CAGTGGCAAT
TACGGCAACT GGGCACCCAA CCCGGCCATG CGGATGGTTA AGTTGCTGGC CAGTATGAAA
GACGATAATG ACCATGTTGT TATCAAAGGA TTCTACGACG ATGTGGTGCC GTTAACGGCC
AGTGAACGGA CTGCGCTGGC GAAGGTGCCC AACATGGAAG CGGCCTTAAA GAAGGAACTG
GGCATTGCGC AACCCGACGG AAATGGAACC CCGTTTGTGG AACTACTCAT GCGCCCGACC
TTGAATATCA ACGGAATGCA GAGCGCAAAC GTGGGAGCTA TGGCGGGTAA CATTATCCCA
ACCAAAGCCG AAGCTGTACT TGACTTACGG CTCGTGCGGG GCAATGAGGT AACCCGGCAG
ATCGGTCGGG TGGTCGAGCA TATCCGGGCG CAGGGATACC AGGTGCTGGA TCGCGAACCA
ACCGATGCCG AACGGCAGCA GTTCCCGAAG CTAATTAAAA TTACAACCGG TCACGGCTAC
AATGCCCAGC GAACGCCAAT GGATTTACCC GTTGCTCAGG GCGTTGTGGC GGCTGTTCAG
GCAGTTAGCC CCGAACCAAT CGTTTTGTCG CCTTCCTCGG GGGGGAGTTT GCCCCTGTAT
ATGTTCGAAA AAGTGCTGAA AGCCAACGTG GTATCGGTGC CAGTAGTTAA TTACGATAAC
AATCAACATG CCGAGAACGA GAATGTGAAG GTACAATATC TCTGGGAAGG CATCGAGATT
ATGGGGTCGA TCATGCTGAT TAAGTAA
 
Protein sequence
MSVKTSLLLL ILLPVSVFAQ TPQQRVRQYR QAQETALMDE YREFLSIPNV SADSVNIRKN 
AAFILQMMKK RGISGVLLDG PTPGSTPAVF GEVRVPGAKK TLVFYAHYDG QPVNPKQWGE
GLQPFVPVFI TAPVEQGGKI ITTYKSGDPI DPNWRLSGRG SADDKAGVMT ILNAYDALVK
SNIPLTTNLK FFFEGEEEVG STHLGEIFEK HRDKLAGDLW IIADGPRHVS GKPVVQFGVR
GDVNMYLTVY GPKRPLHSGN YGNWAPNPAM RMVKLLASMK DDNDHVVIKG FYDDVVPLTA
SERTALAKVP NMEAALKKEL GIAQPDGNGT PFVELLMRPT LNINGMQSAN VGAMAGNIIP
TKAEAVLDLR LVRGNEVTRQ IGRVVEHIRA QGYQVLDREP TDAERQQFPK LIKITTGHGY
NAQRTPMDLP VAQGVVAAVQ AVSPEPIVLS PSSGGSLPLY MFEKVLKANV VSVPVVNYDN
NQHAENENVK VQYLWEGIEI MGSIMLIK