Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4170 |
Symbol | |
ID | 8727929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5018403 |
End bp | 5021312 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | peptidase M16 domain protein |
Protein accession | YP_003388956 |
Protein GI | 284039026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.353568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.732695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC GCTTTCCTTT ACTGCTACCC TTCCTCTGCC TCTTTTCTGC CGGAGTGACG TACGCCCAAA CCCGACTCGT TGAGAAAGTA ACCCGGCAGG GTAACGAACT CGTTATTCCC TACGAGAAAT ACGTTCTGCC CAATAGCTTA ACGCTAATTG TTCACGAAGA CCATTCCGAT CCCATCGTTC ATGTCGACGT CACGTACCAC GTAGGCTCGG CCCGTGAAGA AATCGGGAAG TCGGGTTTTG CCCACTTCTT CGAACACATG ATGTTTCAGG GTTCCGACCA CGTAGCCGAC GATGAGCATT TCAAAATCGT GACCGAATCG GGCGGAACGC TCAACGGATC GACCAACCGC GACCGCACGA ACTATTACGA AACGCTCCCC AGCAACCAGC TCGAACGGGC ACTCTGGCTC GAAGCCGACC GCATGGGCTT TCTGCTGGAT GCTGTTACGC AGAAGAAATT CGAGATTCAG CGGGCTACCG TTAAAAACGA ACGCGGCCAG AATTACGACA ACCGGCCTTA TGGACTGGCT GGTGAGTACG TAGCCAAAAA CCTGTATGCG TATGGTCATC CCTACTCATG GCTCACCATC GGCTACATCG AAGACCTGAA CCGAGTCAAT GTAAACGACC TGAAAAATTT CTTCCTGCGG TGGTACGGCC CCAACAATGC CGTGCTGACG ATTGGGGGCG ATGTAACCGC CAAGCAAGTA GTAGCCCTGA CGGAAAAGTA CTTCGGTTCC ATTCCGCGTG GTCCGGAAGT AACAAAAACG CAGGTGCCCA CGCCCGTAGT GGATAAGGAT CGGTACGTTT CGTACGAAGA CAACGTACGC TTCCCCATGC TGCAACTGGT GTTTCCGACG GTGCCGAACT ACCACCCGGA CGAAGCGCCC CTCGACGCGC TGGCCGAAAT TCTGGGCGGT GGTAAAAACT CGCTTTTTTA CAAGAATCTG GTGAAAACAC AACTGGCCGT GCAGGCGAAT GCCTCGCACC CCGCCACGGA ACTGGCCGGG CAGCTGGCCA TGGTCGTCCT GCCGTTTCCG GGCAAGAGTC TGGACAGTAC AGAAGCCGTC GTTCGCCGGA CACTGGCCGA ATTTGAACGA CGGGGTGCTA CCGACGATGA TATTGCGCAA TTCAGAGCCA CCCGCGAAGC CGACCTGATC AACGGCCTGT CGAGTGTATC CGGGAAAGTG TCGCAACTGG CGGCTTTCCA GACTTATCTC GGCAACCCGA ACTACCTGCC GCAGGAGCTG AAACGCTACA AGAGTGTGAC AAAGGCCGAT GTAATGCGGG TATACAACCA GTACATTAAA GGCAAAAACG CGGTAATCCT GACGGTATAT CCCAAAGGGA AACCCGAAAT TGTGGCGAAG CCCGATAACT ATACGGTGTC GACGGCTAAC TACAAGGCAC CAGACTACGG CTATAACGGA CTGGCCTACA CCAAGCCCAA AGACAGTTTC GACCGAGCGG TTAAACCCGG TCCCGGCCCG AACCCGGCGC TGAAAGTCCC TCCCTTCTGG AACGACAAAC TACCCAACGG CATCAAAGTG ATTGGCGCTC GCAACGACGA AATTCCGGCG ATAACGATGC TGTTCACCAT CAAAGGCGGG CATTTGCTGG AAGCCAATGA TCCATCGAAG GCCGGGGTGG CACAGTTGAC CGCTTCGCTG ATGAACGAAG CCACCCAGAA CTATACCAAC GAGCAACTCA ATACGAAACT CGAAAAGTTG GGCAGCAGTA TCGATATCCG GGCCAATACC GAAGAAATCA CCATCTCCGT CGAAGCCTTG ATAAAGAACC TGGATTCGAC ACTGGCTCTG GTTGAAGAGA AATTGCTGCG GCCCAAATTT GCTCAGGACG ATTTTGATCG ACTGAAGAAG CAACAGCTCG AACTGATCAG CAACCAGAGT ACACAACCGG TTGTGATTGC CAATAAAGCC TATAGCAAAC TGTTGTACGG CTCTGCCAAC ATCCGGTCGG TGCCCTTGAG CGGAACGACC AAAACCGTTG AAACCATCAC CCTGGACGAT GTGAAGGCTT TCTACAAAAA TTACCTCTCG CCATCAGTAA CGAATATGGT GGTTGTAGGC GATATTGAGC AGGCGGCTAT CATGCCTAAA CTGGCGTTTT TATCGAAGTG GGCAGCTAAA CCGGTAAAAA TACCGACGAC ACCGGCCCCT AAGAAAATCG ACAAAACCCG CCTTTATCTG ATCGACAAAG AACAGGCGGC TCAGTCCGAA ATTCGTATCG GTTACCTCAC CAATATGCCT TACGATGCTA CCGGTGATTA CTACAAGGCC GCATTAGCCA ACTACATGCT GGGTGGAGCC TTCAGCAGCC GCATCAACAT GAACCTGCGC GAAGACAAAG GCTATACCTA CGGGGCTCGC TCTGGTTTTT CGAGTACTAA CACCCCCGGC CCCTTTACGG CGCAGGCGGG CGTGAAAGCA GCGGCAACAG ACAGCTCAGT AATTGAATTT GTGAAGGAGA TTACCAACTA CGCCAAGTCG GGCATTACGG AGCAGGAACT GGCCTTTGTG AAAAGTTCGC TGGGACAAAG CGACGCCCTG CGTTACGAAA CATCGCTCCA GAAGGCGTTT TTCCTCAGCC GGATCATCGA GTACAACCTG CCCCGCAATT ATGTGGAGCA GCAGAGCGAA ATTCTCCGTA AGATCACCAA AGCCGAAATC GACGCGGTTG CCAAAAAGCA ACTACCTATC AATAACATGA TTATAACGGT AGTTGGTAAC AAGCAATTGA TTAAACCCGG CCTGGAAAAA CTGGGCTACG AACTGGTTGA GCTGGATAAA GAAGGCAACG TACTGGGTTC GTCAACCACC CCGGCCGACG TACCAGCGGG CGGTGCCCCG GCTAAAATGT CATCGGGCAA GAAGAATTAA
|
Protein sequence | MTTRFPLLLP FLCLFSAGVT YAQTRLVEKV TRQGNELVIP YEKYVLPNSL TLIVHEDHSD PIVHVDVTYH VGSAREEIGK SGFAHFFEHM MFQGSDHVAD DEHFKIVTES GGTLNGSTNR DRTNYYETLP SNQLERALWL EADRMGFLLD AVTQKKFEIQ RATVKNERGQ NYDNRPYGLA GEYVAKNLYA YGHPYSWLTI GYIEDLNRVN VNDLKNFFLR WYGPNNAVLT IGGDVTAKQV VALTEKYFGS IPRGPEVTKT QVPTPVVDKD RYVSYEDNVR FPMLQLVFPT VPNYHPDEAP LDALAEILGG GKNSLFYKNL VKTQLAVQAN ASHPATELAG QLAMVVLPFP GKSLDSTEAV VRRTLAEFER RGATDDDIAQ FRATREADLI NGLSSVSGKV SQLAAFQTYL GNPNYLPQEL KRYKSVTKAD VMRVYNQYIK GKNAVILTVY PKGKPEIVAK PDNYTVSTAN YKAPDYGYNG LAYTKPKDSF DRAVKPGPGP NPALKVPPFW NDKLPNGIKV IGARNDEIPA ITMLFTIKGG HLLEANDPSK AGVAQLTASL MNEATQNYTN EQLNTKLEKL GSSIDIRANT EEITISVEAL IKNLDSTLAL VEEKLLRPKF AQDDFDRLKK QQLELISNQS TQPVVIANKA YSKLLYGSAN IRSVPLSGTT KTVETITLDD VKAFYKNYLS PSVTNMVVVG DIEQAAIMPK LAFLSKWAAK PVKIPTTPAP KKIDKTRLYL IDKEQAAQSE IRIGYLTNMP YDATGDYYKA ALANYMLGGA FSSRINMNLR EDKGYTYGAR SGFSSTNTPG PFTAQAGVKA AATDSSVIEF VKEITNYAKS GITEQELAFV KSSLGQSDAL RYETSLQKAF FLSRIIEYNL PRNYVEQQSE ILRKITKAEI DAVAKKQLPI NNMIITVVGN KQLIKPGLEK LGYELVELDK EGNVLGSSTT PADVPAGGAP AKMSSGKKN
|
| |