Gene Slin_5737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5737 
Symbol 
ID8729512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6979444 
End bp6981579 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content53% 
IMG OID 
ProductPeptidase S46 
Protein accessionYP_003390501 
Protein GI284040571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.654767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTTA GAAAATTTCG TCTCGCTTTG CTGGCAACCT CTTGCCTGGC ATTCCTGGCT 
GGCCCGGCGT CAGCGCAAAC GACCACCGAT ACAACCAAAG GTGGCCCGCT CGATCTGGGA
AAAATGTGGA CCTTCGATAG TCCACCGTCC GCCTATTTTA AGAAAACCTA CAACTTTACC
GCCGATGAAA AGTGGTTCGA CGAAGCGCGG TTGGCTTCCC TGCGATTTGC TGATTACTGC
TCAGCGTCGT TTGTGTCGGC CAATGGGTTG GTGATGACCA ACCACCACTG CGCCCGTGAG
TCGGGTACGG GCGTAACCCG AAAAGGCGAA GACCTGAATG CAACCGGATT CTTTGCCAAA
ACACCCGCCG AAGAGCGGAA GGTGGATGGC CTGTTCGTTG ATCAACTGGT GAAGCTTCAG
GACATCACCA AACAGGTGCA GGACGCCATG AGTGGCGCAA CTTCCGAACA GGCGCAGTTG
CAGGCCCGTG AGCAGGCGTT TTCGGCCATT AAGCAGGAGT ATGGTACAAA AGAAGGCTGG
AAAGGACTCG AACTTCAAAC CATCACGTTT TATAACGGAG GCCGTTATGC GTTGTATGGC
TTCAAACGAT ATACCGATGT GCGGCTGGTG TTTATGCCTG AGTTACAGCT TGGCTTCTTC
GGGGGCGATT ACGACAATTT TACCTACCCG CGTTATGCCC TCGACTGCTC GTTTTTTCGA
GTATATGACG GCGGCAAACC GTTGAAAACA ACTCATTTTT TTAAGTTCAA TATGAACGGT
GTTCGCGATG GCGAACCCAT TTTTGTGATT GGAAATCCCG GCCATACCGA GCGGCTCAAG
ACCGTTGCCG AACTTGAATT TGACCGGGAT TTGCAGACGC CCGCTACTAT TCAGATGCTC
CGTAACCGTT CGGCGGCTTT ACAGGCTTAT AATGCTACGG CGAAGAATGA CAGCGTGCTG
AACGAGATTT TCAGCTATGA AAATAGTCTG AAAGCTTACG GCGGTCAGCT CGAAGGGCTG
CGCGATGCCA GTTTGCTGAA CCGTAAGGTG GTATTTGAGA ATCAGTTCAA AGCAGCAGCC
AAAGCTAAAA ACCTGCCCGC CGATCAGCTG AAAACGTGGG ATGAACTGGC TGCCAATACC
GCACAGCTAC GGAGTCTGTT CAAAGACGCG AACTACCTCG GACCCAGCGA GCGCACTATG
GGCGAGCTGC TGACCTTCGC CAATGTTGTT ACCCAGTTTA GCGAGTTACT GGCTACCCGC
CCGCAGGATG CCGAACGGGC TCGCTCGCTG ATGGTTACGC CAGAGGTGAA AAGTATGGCG
CTGGAAGAAG CGTATCTGGC AGCTCACCTG ACCGAAGCAC AGCTTGGTTT AGGAAATGAC
GATCCGTATG TGAAAGCTGC CTTGACCGGT ACCAACGGTA AGCGGCTAAC TCCCAAAGAG
GCCGCAGCGT ACCTTGTGAA AAATACAAAG TTGACCGATC CGGCCTTCGT GAGTGAGCTA
TCGACCCGAC CCAATGCCGC AGCGGCCTCC AACGATCCTA TGCTGGCGCT GGCGCGAATT
GGGTTTCCCC GCTACCTGGC AGCCGCCCGT CAGGCCCGTC AGATTTCGCA GAAGCAGGAG
GTGCTTCGTG GACAACTGGG CCGGATGTTG TATGATGTAT ACGGCACCGC TGTTCCACCC
GACGCTACCT TCTCGCTGCG GATCAACGAC GGTGTGGTGC AGTCCTATGA TTACAATGGT
ACGAAAGCGC CTATTCTGAC GACCTTCGCG GGCTTGTATG ATCGTAATTA CTCCTTCGCC
GATAAAGCGC CCTGGAATTT GCCTGCCCGT TGGAAGAGTC CGCCCATGGA ATTACTGAAA
CAACCCATGT GCTTCATCTC GACTAATGAT ATCATTGGCG GTAATTCGGG TAGTCCGATG
ATCAATAAAA ACCTCGAGGC CGTTGGGCTA GCCTTCGACG GGAACATGGA GAGTCTGCCC
GGCGAGTTCA TCTTCGTCCC CGACCGTAAC CGAACCATTT CGGTGCATAC AGGCGGCATC
ATTGCGGCCA TGCGGTATAT TTATAAAGCA GATCGGCTGG TTAGCGAGCT GACGGGTACG
CCGGTAACAG CGAAGCCCAA ACCAGTAAAA AAATAG
 
Protein sequence
MQFRKFRLAL LATSCLAFLA GPASAQTTTD TTKGGPLDLG KMWTFDSPPS AYFKKTYNFT 
ADEKWFDEAR LASLRFADYC SASFVSANGL VMTNHHCARE SGTGVTRKGE DLNATGFFAK
TPAEERKVDG LFVDQLVKLQ DITKQVQDAM SGATSEQAQL QAREQAFSAI KQEYGTKEGW
KGLELQTITF YNGGRYALYG FKRYTDVRLV FMPELQLGFF GGDYDNFTYP RYALDCSFFR
VYDGGKPLKT THFFKFNMNG VRDGEPIFVI GNPGHTERLK TVAELEFDRD LQTPATIQML
RNRSAALQAY NATAKNDSVL NEIFSYENSL KAYGGQLEGL RDASLLNRKV VFENQFKAAA
KAKNLPADQL KTWDELAANT AQLRSLFKDA NYLGPSERTM GELLTFANVV TQFSELLATR
PQDAERARSL MVTPEVKSMA LEEAYLAAHL TEAQLGLGND DPYVKAALTG TNGKRLTPKE
AAAYLVKNTK LTDPAFVSEL STRPNAAAAS NDPMLALARI GFPRYLAAAR QARQISQKQE
VLRGQLGRML YDVYGTAVPP DATFSLRIND GVVQSYDYNG TKAPILTTFA GLYDRNYSFA
DKAPWNLPAR WKSPPMELLK QPMCFISTND IIGGNSGSPM INKNLEAVGL AFDGNMESLP
GEFIFVPDRN RTISVHTGGI IAAMRYIYKA DRLVSELTGT PVTAKPKPVK K