Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3940 |
Symbol | |
ID | 8727698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4722278 |
End bp | 4723336 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | arsenical-resistance protein |
Protein accession | YP_003388729 |
Protein GI | 284038799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.29771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.461155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACCG AAACGCTACC GGGTAAACCC GTCAACCGAC AGCTTTCTTT TCTGGACCGC TACCTCACGC TCTGGATATT CGGAGCGATG GCTCTGGGTA TTCTGCTGGG GAACCTGTTT CCTCAAATTG AGCAATCCCT GAATGCCTAC CAGCAGGGTA CAACCAATAT TCCGCTGGCC ATTGGCCTGA TCCTGATGAT GTATCCGCCC CTGGCCAAAG TCCGGTACGA CCAGTTGCCT CAGGTATTCC GAAACAGAAA AATCCTGCTT CTTTCGCTGG TACAAAACTG GCTTATTGGC CCGGTATTGA TGTTTGTACT GGCCGTTTTA TTATTACCCG ACAAACCCGA ATACATGACC GGTCTGATCA TGATCGGCAT AGCGCGTTGT ATCGCCATGG TCATTGTCTG GAATGATCTG GCCCTCGGCG ACCGCGAATA CGTAGCAGGA CTGGTGGCCT TTAACAGCGT CTTTCAGGTC CTTTTCTATT CGGTCTATGC CTACGTTTTT GTTACGGTAC TACCTCCCCT ATTCGGTTTA CCCGGCATGG CAGTCGACAT CAGTATCGGT CAGATTGCCG AGAGCGTGTT CATCTACCTG GGTGTTCCGT TCCTGGCAGG TATGTTGTCT CGTTGGTCAC TGACCCGGTG GAAAGGGCAG TACTGGTACG AAACCCGCTT CCTGCCAGCC ATCAGCCCTA TTACACTAGT GGCCTTGCTC TTTACCATCG TGGCCATGTT TAGCCTGAAG GGTCGTCTGG TGCTGGAATT ACCAGGCGAT GTATTACGAA TAGCCCTGCC GCTGGTTGTC TACTTCGGGC TGATGTTCTT TGCCGCTTTC TACCTGGCTA AACGCGCTGG AGCCGATTAT CCTAAAAGCA CGTCGCTCGC CTTTACAGCC GCTGGCAATA ATTTCGAACT GGGCATTGCC GTAGCTATTT CTGTGTTCGG CATCAACTCC GGCGTAGCGT TTGCCGCTGT TATCGGGCCG CTGATCGAAG TACCCGTTCT GATTCTGCTA GTCAGTTTTG CGCTTAAACA GCAACCTAAG TTTAGTTGA
|
Protein sequence | MNTETLPGKP VNRQLSFLDR YLTLWIFGAM ALGILLGNLF PQIEQSLNAY QQGTTNIPLA IGLILMMYPP LAKVRYDQLP QVFRNRKILL LSLVQNWLIG PVLMFVLAVL LLPDKPEYMT GLIMIGIARC IAMVIVWNDL ALGDREYVAG LVAFNSVFQV LFYSVYAYVF VTVLPPLFGL PGMAVDISIG QIAESVFIYL GVPFLAGMLS RWSLTRWKGQ YWYETRFLPA ISPITLVALL FTIVAMFSLK GRLVLELPGD VLRIALPLVV YFGLMFFAAF YLAKRAGADY PKSTSLAFTA AGNNFELGIA VAISVFGINS GVAFAAVIGP LIEVPVLILL VSFALKQQPK FS
|
| |