Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2290 |
Symbol | |
ID | 8726032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2770386 |
End bp | 2771666 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003387110 |
Protein GI | 284037180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.485265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00935989 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACATGA CAGTTATCGC AAAATCTCGC TGGTACAGGC TTCTCCCCAT TGCTTTTATC ACCTACAGCC TGGCTTATCT GGACCGGGCA AATTTCGGAT TCGGCGCAGC CAGCGGCATG GCTGCAGATT TACACATTAC CCCGGCTATG TCTTCGCTGC TGGGATCACT ATTTTTTCTG GGCTACTTTT TCTTTCAGGT ACCCGGAGCT GTTTATGCCG AGAAGAAAAG TGCTAAATCC TTAATCTTCT GGTCCCTGAT TTTGTGGGGC GGGCTGGCCA TGGCAACCGG CATAATCAGC AATGTCAATC TGCTGATTAT TATTCGTTTT ATGCTGGGCG TAGTTGAAAG TGCGGTTATG CCTTCAATGC TGCTTTTCTT AAGCCGATGG TTTACCAAAG CAGAACGCTC TCAGGCCAAC ACATTTTTGA TTCTGGGTAA TCCGGCCACA ATTCTCTGGA TGTCGGTACT ATCTGGCTAC CTGGTTCAAG CTGTTGGATG GCGCTGGATG TTTATTCTCG AAGGTCTGCC AGCTATTATC TGGGCCTTTT TCTGGTGGCG ACTGGTTGAT AATACACCCG AAGATGCCAC GTGGCTAACG GTTGATGAAA AGAAAGCGCT TGTCGATCAA CTGCAACGGG AGCAGCAGGG CATAAAGCCG GTAAAGAATT ATGCACAAGC TTTCAAATCC AGAATTGTCA TACTACTTAG TCTGCAATAT GCCTTGTGGA GTATTGGCGT TTACGGCTTT GTGATGTGGC TTCCTTCCAT TCTTAATGCG GCACCTAACA TGACAATCGT TAAAACAGGC TGGCTGTCCT CAATACCTTA TGTACTTGCC ATAATTGGTA TGTTAGGCGC ATCCTATTTT TCGGATAAAA CACTGAACAG AAAATCGTTT GTGTGGCCGT TTTTACTTTT GGGGGCTATT GCTTTCTATG GCTCTTACCT GGCTGGAGCC GATCATTTCT GGATTTCTTT CGTCCTCCTT ATTATTGCCG GGGGAGCTAT GTATGCCCCA TACGGCCCAT TTTTTGCCAT CATTCCCGAA CTACTACCCA GAAACGTAGC CGGAGGAGCA ATGGCCCTGA TCAATAGCTT TGGCGCTTTA GGCTCATTTA TCGGCGCCTA TATAGTAGGC TATCTAAATG GCTCCACCGG TGGTTTTGGT ACATCTTATC TATTTATGGC AGGTTCGCTT TTACTATCTG CTATTATAAC GCTGATTGCG CTGGATAAAC CAGCTACTAC ACAGGAAAAC AGTCCGGTCA CGGTCAGCTA A
|
Protein sequence | MYMTVIAKSR WYRLLPIAFI TYSLAYLDRA NFGFGAASGM AADLHITPAM SSLLGSLFFL GYFFFQVPGA VYAEKKSAKS LIFWSLILWG GLAMATGIIS NVNLLIIIRF MLGVVESAVM PSMLLFLSRW FTKAERSQAN TFLILGNPAT ILWMSVLSGY LVQAVGWRWM FILEGLPAII WAFFWWRLVD NTPEDATWLT VDEKKALVDQ LQREQQGIKP VKNYAQAFKS RIVILLSLQY ALWSIGVYGF VMWLPSILNA APNMTIVKTG WLSSIPYVLA IIGMLGASYF SDKTLNRKSF VWPFLLLGAI AFYGSYLAGA DHFWISFVLL IIAGGAMYAP YGPFFAIIPE LLPRNVAGGA MALINSFGAL GSFIGAYIVG YLNGSTGGFG TSYLFMAGSL LLSAIITLIA LDKPATTQEN SPVTVS
|
| |