Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3266 |
Symbol | |
ID | 8727019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3945429 |
End bp | 3948380 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003388076 |
Protein GI | 284038146 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.348309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.462264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCGTT CCGGTCTACA GCACCGGAAC GACCTATCTA TTTTCCTTAA CCAATCAGCT AACTATATGA AGATTTTTAC TGACAAATTA CCACTCAGGT CGGCACTCGT AACGGGACTT GCCTTGGGTA GCCTGTCTGT ACAGGCGCAA AAAATAGACG AGCTGTATAA CCAGAAGATC AGGGAATACA CGACAGATGC CCGGTTTCTG CCTGCTTCGG TGCTGAACCT GCCCGACGAC CCTAAAATCC CATCTCCGCT CAAACATTTC GGTCAGATTG TGGGCACGCC CGGCGTCATT CACCGCACGC CCGAGATTTA TGGCTATTAT CAGAAACTGG CTCAGACGTC ACCCAACATC AGTGTACAGC AGGTTAGTAC TACGGAGGAG GGGCGCCCGA TTCAGTTGGT GGTTATCGGC AGTGAAGACG CCATGAAGCG GCTCGATCAC TATAAAAAGC AACTCGCCCT GCTGGCCGAC CCCCGTAAAG TAGGGAGTCA GGATGTTGAA AAGATTCTGG GTGATACCAA GCTGGTGTAC TACCTCAACG GTGGCCTGCA CTCGCCCGAA ATGGGGTCAC CGGAGATGCT GATGGAGCTG GCTTACCGGT TGGTTACCAG CCAGTCGCCC GAGATAAAGA CGATTCGGGA TAACATCATT GTGCTAATCA ATCCGGTGTC GGAACCCGAC GGCTGGGATA AACAGGTCGA CTGGTACAAC CGCTATACCA AAGGCCGGAA AGAGTACGAT GACGGGTTTC CGAAATCGCC ACCGTACTGG GGAAAATACA CCTACCACGA TAACAACCGG GATGGTTTAC AGGCATCGCA GGAGTTGACG AAAGCGCTCT ATAAAATCTT TTACGAATGG CATCCCACTG CCAGTCTCGA CCTGCACGAG TCCGTCCCGT TGCTGTACAT ATCCACCGGA ACAGGGCCGT ATAACGAAAC CATCGACCCG ATCACCATTG GCGAATGGCA GATCATGGCG CACCACGACA TCACGACACT GGCCTCGCAG GGAGTTCCGG GCGTGTTTAC GTGGGCGTTT TACGATGGCT GGTATCCTGG TTATGCACTC TGGATTTCCA ATAACCACAA TGCCGTTGGC CGTTTTTATG AAACCTTCGG GAACGCCGGG GCGAACACCT ACCTGCGCGA TCTGGCCGAG CAGAAATACG CGGGCGACCC CGCCACTACG AAAGAATGGT ATCGGCCCGT GCCGCCCACC GAAAAAGTTT ACTGGTCGTA CCGGAATGGC ATCAATTACA TGCAGGCCGG GGTGCTGGCA TCGCTGTCGT ACGGCGCCAC GAATAGTCGG CTGTTGTTAA AAAACTTCTA TCAGAAAGGG CTGAACAACA TCAAAAAAGG GACGGAAGAA ACGCCACGTG CGTTCGTTAT TCCCAAAAAT CAGCGCGACC CGGCTATGGC GGCTTACCTG GTCAATCAAC TGCGTACGCA GGCCATTGAA GTTCATCAGG CGGAGTCGGG CAAGAACAAA GGCGATTATG TCGTGTTGCT GAACCAGCCC TACCGCAATC TGGCCGTTTC ACTGCTAACG AAGCAGAACT ACCCGAAAGA AGCGAAATTT CCCCCCTACG ACGATATTGC GTGGACGCTG GGTTACCTGT ATGGGGTGGA CGTAAAAGCC GAAGATAGCG TCAACTATGT ACCCAGTGAG CTTAAACTCC TGAGCGAGAA TGTTAATTAT GCGGGGACGA TGGAGGGAGA GGGAACAAAC TATGTTCTCA ACTACAAAGC CCAAACCAAT GTGCTGCCCG CCCTGCTTTG GCTGAAAGGG CAGAGCAAGC AGGCAAAAGC CGTTGTGCTC GATACCAAAG CTACGTTCGG CGGACTAAAA GACACGCTGT CGGCGGGTGC AGTGGTGTTC AAAGGACTCA CAGGCGATCA GGCCAAGAAA CTGGCAGCGC AGTTTGGGCT GGATTTACAG GCAACAAAAG CGGAGCCCAT GGGTGTTGGA TCGCCCTTGC GGCAGCACGA GGTTACTCTG CCTCGGGTGG CCATTTACCA CAGCTGGTAC AACACGCAGG ATGAAGGCTG GGCGCGGTAT ACGTTCGAGC AGCGCGGTAT TCCATATATG TCCATCAACA AAGACCACCT GAAAGCGGGC GATTTGCGGA AGAAGTTCGA TGTGATTCTC ATTCCCCGGA TGCGCGGCAC ATCGACCAAC TTTATCCATG AAATCGACAA ACGATTCGGT CCCCTGCCGT ACACCAGAAC GCCCGAGTTT CCATCGCATG GTTTCCCGGA CGCCAGCAGC GATATTACCG GTGGGCCCGG ATTCGACGGC GTCGATAAAC TGAAACAGTT CGTCGAACAG GGCGGTGTGC TTGTCACGCT TGACAATTCA TCGCTCATGG TTGCCGAAGC GGGCATCACC CGCGATCTGG ACGAAGTGGC GGCTCCTACG CTGTTTCATC CGGGCTCCAT CGTAACGGTG AAAAACCGGC GTCCCGATAG CCCCGTTATG TACGGCTTTC CCGAAATCTT TCCCATTTTT CGGGGAATTG CTCCGTTGCT GCAAACAAAA AAGCATAACC GCGACATGAT GCTGATGCAG TATGGCACCA AACCGCTCAA AGACGAAGAA GAATACAAGG GACTGATTAT GGGCATGCCC GATAAAAAAC CGGCTAAAGA AGCGAAGGCG ACACCCAAAA AAGAAGACCC GTATGTGGTG TCAGGGATGG TTCGCAATGA GCAGACGATC ATTGGGCATG GCGGGATTTT CAACGTGCCG GTGGGTAGCG GCCGGGTCAT TGCTTTCACC TTCGATCCAC TGCATCGGTA CCTCAACCAC CACGACGCCC CGCTACTCTG GAACGTGCTG ATCAACTGGA ATCATCTGGA TACACCGCCC GTATCGGCCA CAGCAGACAC CGAAACACCG AACCGGGCCA ATTCACCAGT CATAAAGACA GGAGATAATT AG
|
Protein sequence | MGRSGLQHRN DLSIFLNQSA NYMKIFTDKL PLRSALVTGL ALGSLSVQAQ KIDELYNQKI REYTTDARFL PASVLNLPDD PKIPSPLKHF GQIVGTPGVI HRTPEIYGYY QKLAQTSPNI SVQQVSTTEE GRPIQLVVIG SEDAMKRLDH YKKQLALLAD PRKVGSQDVE KILGDTKLVY YLNGGLHSPE MGSPEMLMEL AYRLVTSQSP EIKTIRDNII VLINPVSEPD GWDKQVDWYN RYTKGRKEYD DGFPKSPPYW GKYTYHDNNR DGLQASQELT KALYKIFYEW HPTASLDLHE SVPLLYISTG TGPYNETIDP ITIGEWQIMA HHDITTLASQ GVPGVFTWAF YDGWYPGYAL WISNNHNAVG RFYETFGNAG ANTYLRDLAE QKYAGDPATT KEWYRPVPPT EKVYWSYRNG INYMQAGVLA SLSYGATNSR LLLKNFYQKG LNNIKKGTEE TPRAFVIPKN QRDPAMAAYL VNQLRTQAIE VHQAESGKNK GDYVVLLNQP YRNLAVSLLT KQNYPKEAKF PPYDDIAWTL GYLYGVDVKA EDSVNYVPSE LKLLSENVNY AGTMEGEGTN YVLNYKAQTN VLPALLWLKG QSKQAKAVVL DTKATFGGLK DTLSAGAVVF KGLTGDQAKK LAAQFGLDLQ ATKAEPMGVG SPLRQHEVTL PRVAIYHSWY NTQDEGWARY TFEQRGIPYM SINKDHLKAG DLRKKFDVIL IPRMRGTSTN FIHEIDKRFG PLPYTRTPEF PSHGFPDASS DITGGPGFDG VDKLKQFVEQ GGVLVTLDNS SLMVAEAGIT RDLDEVAAPT LFHPGSIVTV KNRRPDSPVM YGFPEIFPIF RGIAPLLQTK KHNRDMMLMQ YGTKPLKDEE EYKGLIMGMP DKKPAKEAKA TPKKEDPYVV SGMVRNEQTI IGHGGIFNVP VGSGRVIAFT FDPLHRYLNH HDAPLLWNVL INWNHLDTPP VSATADTETP NRANSPVIKT GDN
|
| |