Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5152 |
Symbol | |
ID | 8728918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6290429 |
End bp | 6291949 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | protease Do |
Protein accession | YP_003389923 |
Protein GI | 284039993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000102653 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.175523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCA ACTGGAAATT ATTAGCGTTG ATGGCACTCC TGTCGAGTGT AGTAACGCTG GCCGCTTACA ACCTGTTGGG ATTCAACAAC CGGGATGTAA TTCTTAACGA AGCGTCGCCC ATTCAGCAGA TTACGGGCCG TCTGGCATCT ATGCCCGGTG GACCGAGCGC TGCACCCGGC GACTTCTCTA CGGCTGCCGA AGCCGTAACA CCAATGGTTG TACACATTCG CACAACCATG ACCCGTACTG TACGTCAGCA ACAGGTTCCC GACATTTTCC GGGAGTTCTT TGGCGATGAA TTTGGTGGTG GTCAGCGGCA GCCCCGCCGT CAGCAGGGTC AGGCATCTGG CTCGGGCGTA ATCATCAGCA AAGACGGTTA TATCGTAACC AATAACCACG TGGTACAGGA TGCCGATGAG GTTGAGGTTA TCATGACCGA CAAACGCAGC TTTAAAGCGA AAGTAATCGG TACCGACCCA TTGACCGACC TGGCCGTTAT TAAAGTAGAA GCCAACAATC TGCCAGCTAT TACGCTGGGT GATTCCGACG CTCTGAAATT AGGCGAATGG GTTTTGGCCG TTGGTTACCC ACTTGATCTC GAATCGACCG TTACGGCCGG TATCGTGAGC GCAAAAGGTC GCCGGATTGG TATCCTCGAC CAGAACATTA GCAAAACGGA TGCGAAGCCT GATTCGCCGG TTGAAGCTTT CATCCAGACG GATGCTGCCA TCAACCCTGG TAACTCAGGT GGTGCGCTGG TTAACCTGCG TGGCGAATTA GTCGGTATTA ACTCGGCTAT TGCTTCGGCA ACGGGCTATT ACAGCGGTTA TGGCTTTGCG GTACCTGTAT CGCTCGTGAA GAAAGTATCT GCCGACCTGC TCAAATATGG TAACGTACAA CGCGGTTATA TCGGCATTCT GCCAATTGAA CTGAACAGCA CGGTAGCTAA AGAGAAAGGT GCGAAAATTG GTCGTGGCAT CTACGTCGAG AGCGTTGTTG AAAAAGGTGC AGCTGAAGCC GCTGGTCTGA AAAAGGGTGA CGTCATCGTG AAAATGGAAG GCCAGCCGCT TGATTCAGAT GCGCAAATGC GTGAAATCAT CGGTCGTCGT CGTCCGGGCG ATGTGGTTAA TGTAACGGTT AACCGGGATG GTACCGAGCG TGACTTTAAA GTCGAACTTC GTAACCGTAA TGGTGGCCGG GATGTGATCA AGAAATCGGA CATTACCGCA GCCAATACCT CATTAAGTAC GCTGGGTGCC AGCTTTGAAG AGCTATCAGC TCAGGAAGCA AAACAGCTTG GTGTTACCGG CGGGGTTCGG GTCAAAAAAA TTACTGATGG TAAGCTGGCC GAAACTGATA TTGAGGAAGG CTTCATTATC GTAAAGGCAA ACGGTAAGAA CGTCAAAACG ACGAAAGACC TGCAAGCCAT CATGTCGACC GTTAAAGAAG GCGAAGGCCT GATGCTGATC GGCATGTATC CCAACAGCTC ACGGATGTAC TACTACGCCG TTCCGGTGTA A
|
Protein sequence | MKSNWKLLAL MALLSSVVTL AAYNLLGFNN RDVILNEASP IQQITGRLAS MPGGPSAAPG DFSTAAEAVT PMVVHIRTTM TRTVRQQQVP DIFREFFGDE FGGGQRQPRR QQGQASGSGV IISKDGYIVT NNHVVQDADE VEVIMTDKRS FKAKVIGTDP LTDLAVIKVE ANNLPAITLG DSDALKLGEW VLAVGYPLDL ESTVTAGIVS AKGRRIGILD QNISKTDAKP DSPVEAFIQT DAAINPGNSG GALVNLRGEL VGINSAIASA TGYYSGYGFA VPVSLVKKVS ADLLKYGNVQ RGYIGILPIE LNSTVAKEKG AKIGRGIYVE SVVEKGAAEA AGLKKGDVIV KMEGQPLDSD AQMREIIGRR RPGDVVNVTV NRDGTERDFK VELRNRNGGR DVIKKSDITA ANTSLSTLGA SFEELSAQEA KQLGVTGGVR VKKITDGKLA ETDIEEGFII VKANGKNVKT TKDLQAIMST VKEGEGLMLI GMYPNSSRMY YYAVPV
|
| |