Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1020 |
Symbol | |
ID | 8724750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1234963 |
End bp | 1238076 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | acriflavin resistance protein |
Protein accession | YP_003385870 |
Protein GI | 284035940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000109455 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTCC CCGAGCTTAG TCTGAATCGA CCGGTTTTTG CCATGGTGAT GTCTATTGTC ATCGTCCTTT TTGGCATTAT CGGTTTCACC TTTCTGGGTG TTCGCGAATA CCCGGCTATC GACCCGCCGG TTATTTCGGT ACGGACAAAC TATACCGGTG CCAACCCAGA TATCATTGAA TCGCAGATTA CGGAACCCAT CGAGAAATCG TTGAACAGTA TCGAAGGGAT TCGTACAATC TCGTCGAACA GCGCGCTCGG TGCCAGTACC ATTACAGTCG AGTTCAACCT CGATGCCGAT TTGGAACAGG CCGCCAACGA TGTACGCGAT AAGGTAGCCC AGGCCCAACG GCAACTGCCG CAGGATATCG ACGCCCCACC TGTGGTAACA AAAGCCGATG CCAACTCTGA TCCTATTATT TTCATGACGG TTCAGAGTAC GACCCGCAAT CCAACCCAGT TGTCGGATTA CGCCGAAAAC GTGCTTCAGG AACGTCTGCA AACGATTCCG GGCGTTAGTC AGGCCAACAT CTATGGCTTG AAGCGTCAGG CTATGCGCCT TTGGATTGAC CCCATCAAAC TATCGGCCTA TCGGCTCACC TCACAGGATA TTCAGACCGC ACTGAATGCC CAGAACGTTG AGTTACCCAG CGGTAAAGTG TATGGTAATA CAACCGAGCT GACGGTGAAG GCGGTTGGTC GTCTGACAAC CGAAGATGAT TTCAATAACC TCATCCTTCG CCAGACGAGC AATCAAATTG TTCGTTTCAA AGATGTCGGG TATGCAACCA TCGGTGCGGA GAACGAAGAA ACTATCTCTA AACAGAATGG GGCAGTAGGG GTTATTCTGG CGCTTATTCC TCAGCCAGGT GCCAACTATG TGAGCATTGC CGATGAGTTT TATAAGCGCT TCGACCAACT CAAAAAAGAC CTGCCCGAGG ATATTATCGT AAGTATCGGC GTTGACCGGA GTACATTTAT CCGACGCGCC ATTGAAGAAG TAGGCGAAAC ACTGCTTATT TCGTTTGTAC TGGTCGTACT GGTTATCTAT TTCTTCTTCC GCGACTGGCT CATTGCTTTC CGACCGCTGA TCGACATTCC GGTATCGCTT ATCGGGGCCT TCTTCATCAT GTATGTGGCC GATTTCAGTA TTAACGTGCT GACCCTGCTC GGTATCGTTC TGGCAACCGG CCTTGTAGTA GATGATGGTA TTGTCGTAAC GGAGAATATC TTCAAGAAGA TTGAGCAGGG CATGGACACC AAGGAAGCTG CCCGCGAAGG TTCTAATGAG ATTTTCTTTG CCGTTATTGC AACCAGTGTT ACACTGGCTA TCGTGTTCCT GCCCATTATA TTCCTGGAGG GTTTTGTGGG TCGTCTGTTC CGCGAATTTG GTATCGTTGT CGCTGGTGCC GTATTAATCT CGGCCTTCGT TTCGCTAACC CTGACCCCGG TGCTTAGTGT AAAGCTCACC AGTAAGAACC ACGGTCGGTC CTGGTTTTAC CGAAAAACAG AGCCCTTTTT CGAATGGCTG GATAATTCTT ACCGGTCGTC ATTGAACAGC TTCATGAAAA AACGGGGCTG GGCGTTTGTT ATGATTGGTG CCTGTCTGCT GTTTATCTTC GGACTTGGCT CTATGCTCAA ATCGGAACTG GCCCCGCTCG AAGATCGTAG TCGGACCCGC CTGGTGATTA CTTCGCCTGA AGGAACAAGC TATGAGGCTC AGGCATCTCT AACGGACAGG GTCATGCAGT TTGTACTCGA CTCTATCCCC GAAACCAAAT TAGCCTTTAG CGTAGTAGCA CCCGGTTTTT CGGGGGCAGG CGCGGTTAAC TCCTCCTTCG TGATGGAGAA CCTGGTAGAC CCCAGCAACC GCAATCGGTC GCAGCAGGAT ATTGTCGATT ATATTAATAA AAATCTCAAG AAGTTCAACG AAGCCCGCAT GTTCGCTACG CAGGACCAGA CCATTCAGGT TGGCCGGGGC GGTGGATTGC CGGTGCAGTT TGTTATCCAG AACCTGAACT TCGAAAAACT CCGCGAGAAA CTGCCAACGT TTCTGGACGA AGTAGCCAAA GACCCAACGT TCCAGAACTC CGACGTAGAC CTGAAGTTTA ACAAACCGGA GCTGAACATT AGCATCGACC GCGAGAAAGC GACGAACCTG GGTATTTCGG TGCAGGATGT TGCCCAAACG CTCCAGCTTG CGCTTAGTAA CCGGCGTCTG GCTTACTTCC TGATGAACGG AAAGCAGTAT CAGGTAATTG GGCAGGTAGA CCGCGCCGAC CGTGATGCCC CCGTCGATCT GGCCTCTTTC TATGTACGTT CCAACCAGGG GCAACTTATT CAGTTAGACA ACCTGGTGAA ATTTCAGGAA GTGAGTAGCC CGCCCCAGGT ATACCACTAC AACCGCTTTA AATCGGCGAC GGTATCGGCG GGTCTGGCAC CCGGCAAAAC GGTGGGCGAC GGTGTAGAGG CCATGCGCGC TATTGCGGCT CGTACCCTCG ACGAAAGTTT CCAGACGGCC CTTTCAGGTC CTTCCCGCGA CTATGCCGAG AGTTCGTCCA ACACCTTATT TGCCTTTGGT CTGGCGTTAA TTCTGGTTTA TTTAGTTCTG GCGGCCCAGT TCGATTCGTT TATCGATCCG CTCATTATCA TGATCACCGT GCCTCTGGCG CTCGCGGGTG CCGTATTCTC ACTCTGGATG TTTAACCAAA CGCTGAATAT CTTCAGCCAG ATCGGGATTA TTATGCTGGT TGGTCTGGTT ACGAAAAACG GAATCCTGAT TGTTGAATTC GCCAATGAAC AGCGACTGAC GGGTAAGAAC AAGTTCGAAG CAGCAGCAGA ATCGGCTGCG TTGCGGCTTC GTCCTATTCT AATGACCACG CTTGTAGCGG CCTTTGGTGC TTTGCCACTG GCCCTTGCCC TGGGTTCGGC TTCAAAGAGC CGGGTACCGC TGGGTATCGT TATCGTGGGA GGACTGATGT TCTCGCTCAT TCTAACCCTG TACGTCGTTC CGGTCATTTA CACGTACATG TCCCGACGGA AAGATGTCCA GCCTGAAGTT GATTCAAAAT CGGAAGACAA GGAAAAACCA ACAAAGCTGG AAGTACATGC TTAA
|
Protein sequence | MSLPELSLNR PVFAMVMSIV IVLFGIIGFT FLGVREYPAI DPPVISVRTN YTGANPDIIE SQITEPIEKS LNSIEGIRTI SSNSALGAST ITVEFNLDAD LEQAANDVRD KVAQAQRQLP QDIDAPPVVT KADANSDPII FMTVQSTTRN PTQLSDYAEN VLQERLQTIP GVSQANIYGL KRQAMRLWID PIKLSAYRLT SQDIQTALNA QNVELPSGKV YGNTTELTVK AVGRLTTEDD FNNLILRQTS NQIVRFKDVG YATIGAENEE TISKQNGAVG VILALIPQPG ANYVSIADEF YKRFDQLKKD LPEDIIVSIG VDRSTFIRRA IEEVGETLLI SFVLVVLVIY FFFRDWLIAF RPLIDIPVSL IGAFFIMYVA DFSINVLTLL GIVLATGLVV DDGIVVTENI FKKIEQGMDT KEAAREGSNE IFFAVIATSV TLAIVFLPII FLEGFVGRLF REFGIVVAGA VLISAFVSLT LTPVLSVKLT SKNHGRSWFY RKTEPFFEWL DNSYRSSLNS FMKKRGWAFV MIGACLLFIF GLGSMLKSEL APLEDRSRTR LVITSPEGTS YEAQASLTDR VMQFVLDSIP ETKLAFSVVA PGFSGAGAVN SSFVMENLVD PSNRNRSQQD IVDYINKNLK KFNEARMFAT QDQTIQVGRG GGLPVQFVIQ NLNFEKLREK LPTFLDEVAK DPTFQNSDVD LKFNKPELNI SIDREKATNL GISVQDVAQT LQLALSNRRL AYFLMNGKQY QVIGQVDRAD RDAPVDLASF YVRSNQGQLI QLDNLVKFQE VSSPPQVYHY NRFKSATVSA GLAPGKTVGD GVEAMRAIAA RTLDESFQTA LSGPSRDYAE SSSNTLFAFG LALILVYLVL AAQFDSFIDP LIIMITVPLA LAGAVFSLWM FNQTLNIFSQ IGIIMLVGLV TKNGILIVEF ANEQRLTGKN KFEAAAESAA LRLRPILMTT LVAAFGALPL ALALGSASKS RVPLGIVIVG GLMFSLILTL YVVPVIYTYM SRRKDVQPEV DSKSEDKEKP TKLEVHA
|
| |