Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0314 |
Symbol | |
ID | 8724042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 403032 |
End bp | 405032 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003385177 |
Protein GI | 284035247 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.820883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGGTAG CCTATGTGAT GCATATCGCT ATCTTTACAG ATATCAACTC CACTACTACC CATCAATACC TCATGAAACA CTGGCTTTAT GCGTTGCTTT CAATTGCTTT CATCCTTCCC GCACACGGTC AACAAACCAC GACCGGCAAC TTCGTTCGCG ATAACTACCA GAAATTCGAG TACAAGATTC CTATGCGCGA CGGTACCAAA CTGCATACGT CGGTCTACGT TCCTAAGGAT GCATCGGCTA CGACGAAGTA CCCATTTCTG ATGCAGCGGA CGTGTTATAG CGTTGCGCCT TACGGTGCCG ATGCTTACCC GGCACAGGTG GGGCCATCGG GAACGCTCAT GCACCAGAAA TTTATCTTTG TGTATCAGGA TGTACGGGGT CGCTGGGCGT CGGAAGGCAC CTGGACCAAC ATGACGCCCA ACCTGCCCGA CCCTCCGGCA TCTGCCAAAA AAGCGGGCAA GAAAGGGCAG ACAACCGTTT CAGCGGCAGG CTCGCCCGAT GAAAGTTCTG ATACCTACGA TACCATTGAG TGGCTGCTTA AAAACGTACC CAATAACAAC GGTCGGGTAG GGCAGTGGGG CATTAGTTAT CCGGGCTTTT ACACGGCTGC ATCTCTACCC GACGCGCACC CCGCCCTGAA AGCCGCATCA CCACAAGCCC CTGTCTCCGA CTTTTTCTTC GATGATTTTC ACCATAACGG GGCCTTCATT CAGGCTTACC TGTTTACGTT TCCCGTTTTT GGTGTGCAGC ATCCCGAGCC AACCACCAAA GCCTGGTATA ACGATCAGAT GATTCAGACG GGCACGAAAG ACGGTTTCCA GTGGCAGTAT GACCTGGGGC CGCTGAAAAA TGCCGATAAA TACTACAAGG ATAATTTTTA CTGGCAGGAA ACCGTCAACC ACCCCAATTA CGATGCGTTC TGGCAGAAAC GGAGTATCCT GCCACATCTG AAAAACGTTC GTCCGGCCGT TATGACGGTG GGCGGCTGGT TCGATGCCGA GGATTTATAC GGTCCGCTGA ACGTGTATAA AACGATTGAG AAGAGCAGTC CCGGTGCCTA TAATACGCTG GTGATGGGGC CGTTCGGACA CGGCCGGTGG TCGCGCGAAA CGGGCCATAC GCTGCATAGC AACGTCTATT TTGGCGATAG CATCGCTACG TTCTACCAGC GGAACATCGA ATCGAAGTTT TTCACGCATT TCCTGAAAGG AGCAGGCGAT GGAAAAAGTG GTTTGCCCGA GGCTTATCTC TTCAATACCG GCCGTAACGA GTGGAAAACG TTCGACAAAT GGCCCGTAGC CGACGCTTCC CAAAAACAGT TTTTCCTGAT GTCGGATGGT ACCCTGGCCA CTAACCGGAC GATGGAGGTT GGCAGCAATC GGTTCTCCGA ATTCATCAGC GACCCGGTTA AACCCGTGCC GTATACGGAA GATATTACCA CCACGCAGGG CTTTACGCCG TTCAATTATA TGTCGGAAGA CCAGCGGTTT GCGGGCCGAC GGCCGGATGT GCTCACCTTC CAGACGGAGG TGCTGACCGA AGATTTAACG CTGGGTGGCG AAATCATGGC GAAGCTCAAA GTGAGTACAA CCGGCACCGA CGCAGACTGG GTGGTCAAGC TCATTGACGT GTACCCGCCC GATGAACCTA ATCATCCGTA TATGCCTAAC CGGAACATTA CGCTGGGCAA TTACCAGCAG ATGGTGCGGT CGGAGGCTAT GCGCGGGCGG TTCCGCAATT CGTTCGAGAA GCCGGAGCCG TTTAAACCCG GCGAGGTGAC TGACGTGAAT TTCCGCCTTC AGGATGTGCT GCATACGTTC AAAAAAGGCC ACCGGGTCAT GATTCAGGTG CAGAGTACGT GGTTCCCGCT CATCGACCGG AATCCGCAGA AATACGTGGA GAATATATTT AAAGCGGATG TTGCCGATTT TCAGAAAGCA ACGCATCGCG TGTACGACAA TTCAGTAATT GAAGTGCAGG TGTTGAAATA A
|
Protein sequence | MLVAYVMHIA IFTDINSTTT HQYLMKHWLY ALLSIAFILP AHGQQTTTGN FVRDNYQKFE YKIPMRDGTK LHTSVYVPKD ASATTKYPFL MQRTCYSVAP YGADAYPAQV GPSGTLMHQK FIFVYQDVRG RWASEGTWTN MTPNLPDPPA SAKKAGKKGQ TTVSAAGSPD ESSDTYDTIE WLLKNVPNNN GRVGQWGISY PGFYTAASLP DAHPALKAAS PQAPVSDFFF DDFHHNGAFI QAYLFTFPVF GVQHPEPTTK AWYNDQMIQT GTKDGFQWQY DLGPLKNADK YYKDNFYWQE TVNHPNYDAF WQKRSILPHL KNVRPAVMTV GGWFDAEDLY GPLNVYKTIE KSSPGAYNTL VMGPFGHGRW SRETGHTLHS NVYFGDSIAT FYQRNIESKF FTHFLKGAGD GKSGLPEAYL FNTGRNEWKT FDKWPVADAS QKQFFLMSDG TLATNRTMEV GSNRFSEFIS DPVKPVPYTE DITTTQGFTP FNYMSEDQRF AGRRPDVLTF QTEVLTEDLT LGGEIMAKLK VSTTGTDADW VVKLIDVYPP DEPNHPYMPN RNITLGNYQQ MVRSEAMRGR FRNSFEKPEP FKPGEVTDVN FRLQDVLHTF KKGHRVMIQV QSTWFPLIDR NPQKYVENIF KADVADFQKA THRVYDNSVI EVQVLK
|
| |