Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4394 |
Symbol | |
ID | 8728154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5328484 |
End bp | 5330211 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003389174 |
Protein GI | 284039244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGAG ATGGTAACGC TATGAAAACG CCGGGCATGC CTGATGGTCA GGACAATAAG AACCGGATTC AGAATGATAA AGCAACCGTG CGTATACCCA TGCTGCTGGG CATTGCACTC GCAGGGGGCA TGTTAATTGG GGCGACGTTC TTTGGTGGCA CCCAGAGCAT GAATAATATC GGACGGGGAT ACAGCAAATA CAAGGAGATT CTGCAACTCA TCGAGAACAA CTATGTCGAT ACTGTCAATA CCGACGATCT GGTCGATTAC TCGATTACCA AAATGCTGGA GAAGCTCGAT CCGCATACGG CCTACATGAA CCCGCAGGAT GCCGTGGCCG CCCGGTCGCA GCTGGAAGGT GGATTTGACG GCATTGGCGT TGAGTTCAAC ATTTACAAAG ACACCGTTTA TGTAGTAACG CCCCTGGCCG GTGGCCCCTC CGAAACGGCG GGTATCCAAA GTGGCGACAA GATCATTAAG GTCGATGATA AACCCCTGGC CGGTGGCAAA ATAGAAAACA GCGCCGTGTT TAAAGCCCTG CGCGGCAAAC GCGATACGAA TGTTAAATTG ACTATTCTGC GAAAAGGCGA CAAGCAACCC AAGGAGTTTA CGATTACACG GGGCCGTATT CCGACGTACT CGGTCGATGC TGCCTATATG ATTGATGCCA AAACCGGTTA CATCAAAATA AACCGGTTCT CGGAAACAAC GTATGACGAG TTCAAGACGG CGCTGGCGTC GCTCAAGGCA AAAGGGATGT CGCAGTTGAT GATGGATCTG CGGAATAACC CCGGTGGATA CATGGACCGG GCTACCAACA TTGCCGACGA ATTTATTTCG GGCAACAAAC TGCTCGTCTA TACCGATGGC AAAGACAACC GGTACGACCG TAAAACGATG GCCCATATTG CAGGGCAGTT TGAAGAAGGT GCCCTGGTTG TGCTTATCGA CGAAGGCAGT GCATCGGCTT CTGAAATTGT GTCGGGTGCA TTGCAGGATC ACGACCGTGC CTTGATTGCT GGTCGGCGGT CGTTCGGGAA GGGGCTGGTA CAAATGCCGG TAACCCTGTC TGACGGTTCC GAACTGCGTC TGACCATTTC GCGCTATTAT ACACCCAGCG GCCGTAGTAT TCAGAAACCG TACGTGCCGG GTCAGGAGGG CGATTATGAA AAAGACCTCG AACTGCGCTC AAAGCGGGGT GAGTATTACA TTGCCGATTC GATCAAAAAC GACCCCAAAC TGAAGTTTAA AACCGACGGC GGACGGGTTG TATACGGCGG TGGCGGTATC ACGCCGGACT ACTTCATTCC CCGGGATTCA ACCTGGCAGA CGGCGTATCT GGTGCAACTT TACGGCAAGA GTATTATCCG CGAGTTTGCA ATGGAATATG CCAATGACAA CCGAAAGAAA CTGGAAAAAA TGTCGTTTGA AGAGTTCGAT AAGGCGGTTA CTATCAACGA TGAGCAGATG AACCGTTTGG TGAAAGACGC TACGGCGGAG GGCATCAAGT TCAACGAGAA AGAGTACAAC CGCTCTAAGA ACTACATCCG TACGCAGATA AAAGCGCTGG TAGCCCGGTC TATTTTCCAG AAGAACAACA AGGGGGGGCA AAACAATGAA TTCTTCCGAA TCATTGCCCA GACGGACGAC ACTTATCAGA AGGCACTGAA ACTCTTTGAT CGGGCCAACA AGCTCGAACA TGGAGCGATG ACGTATAATC AGAAGTGA
|
Protein sequence | MDGDGNAMKT PGMPDGQDNK NRIQNDKATV RIPMLLGIAL AGGMLIGATF FGGTQSMNNI GRGYSKYKEI LQLIENNYVD TVNTDDLVDY SITKMLEKLD PHTAYMNPQD AVAARSQLEG GFDGIGVEFN IYKDTVYVVT PLAGGPSETA GIQSGDKIIK VDDKPLAGGK IENSAVFKAL RGKRDTNVKL TILRKGDKQP KEFTITRGRI PTYSVDAAYM IDAKTGYIKI NRFSETTYDE FKTALASLKA KGMSQLMMDL RNNPGGYMDR ATNIADEFIS GNKLLVYTDG KDNRYDRKTM AHIAGQFEEG ALVVLIDEGS ASASEIVSGA LQDHDRALIA GRRSFGKGLV QMPVTLSDGS ELRLTISRYY TPSGRSIQKP YVPGQEGDYE KDLELRSKRG EYYIADSIKN DPKLKFKTDG GRVVYGGGGI TPDYFIPRDS TWQTAYLVQL YGKSIIREFA MEYANDNRKK LEKMSFEEFD KAVTINDEQM NRLVKDATAE GIKFNEKEYN RSKNYIRTQI KALVARSIFQ KNNKGGQNNE FFRIIAQTDD TYQKALKLFD RANKLEHGAM TYNQK
|
| |