Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3158 |
Symbol | |
ID | 5210128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3978064 |
End bp | 3979314 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640596749 |
Product | transposase, IS4 family protein |
Protein accession | YP_001277469 |
Protein GI | 148657264 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCA CGTCCACAGA ACGGTTGCTG CAGCAGCAGA TCCAGACCCT CTTCCCGCGC TTGTCGCGTC ACCGGCGCCG CGCCCTGGCC CGCTGGGTCT TGGGCGCCTT ATTGGCGGGG AGCGCCAATC GTCCCGCGCT GGTGCACGCG CTCGCCACTG CCGGGATCGC CCGCGCCGCC ACCCTGGCGG ACGCCTGGGA TGCCTGGATC GCCGCCCCGG CCCATCGCAT TGACACCGCC GACCCACCCG CAGCGGGTGC GCCACCGGTG GTCAGTCCGC TGGCGTGCGG CGCCGACCTG CTCCGCTGGA TTCGCGCGCA CTGGACCGGC GGACCGCTGG TGCTTGGGCT GGATGCCTCC CATCGGCGCG ATGACGTCGT TCTGCTGCGC ATGAGCGTCC TCTATCGGGG CACCGCCCTG CCGGTCGCCT GGGTGATCGT CCCGGCGAAC CAACCGGGTG CGTGGGAACC GCACTGGGAG CGGATGCTGC GCTGGGCCCG CAGCGCGCTG CCGCTCGACC AGGAGGTCCT CGTGCTGGCG GATCAGGGGT TGTGGAGCCC CCGGCTGTGG CACGCCATCC GGTCGCAGCA GTTCCATCCC ATCATGCGGG TGCGCACCAC GTCGACCTTC GCGCCGACCG GTCAGGCGCG CCAGTCGGTG CTGCGCCTGG CGCCCGGACC GGGGCATGGA TGGGTGGGCG TGGGGGTCGC CTTCAAGCAC GCACCCAAGC GGATTGCGGG CACGCTGGCG GTGGCGTGGG GCGCCGACCA TGCGGAACCG TGGGTGCTGC TGACCGATCT GCCGCCCGCG CAGGTGGATG CCGCGTGGTA TGCCCTGCGC AGTTGGGATG AGGCGGGCTT CCGCCAAAGT AAGTCGATGG GCTGGGACTG GCAACGCGGT CAGGTGACGG ACCCGGATGC AGTCGCCTGG CAGTATCTGG TAGTGGCGAC GGTCACGCTG TGGACGGTGG CCGTCGGCAC GCGGATCGAA GATGCAGAAC AGCAGGGGGT TCCGCCCGGT CGCCTGAAGC GGGCGCCGCC GACGACCGGC GCGCCGCCGC GCCGTCGCTG GAGCGGCACG GCGCAGCGGG TGATCAGCCT GCTCCGGCGG GGGATGCAGC ACCTGCGCTG GTTGCTGGCG CAAGGGCGTT GTTGGGTCCG CTTGTGGTTG CGCCCGGAGC CCTTGCCCAA AATAGGTGAC AGCGTAACCA TGCATATCTA TGACCCGTCC CAATGCCTGA AATCGCCCTA A
|
Protein sequence | MSRTSTERLL QQQIQTLFPR LSRHRRRALA RWVLGALLAG SANRPALVHA LATAGIARAA TLADAWDAWI AAPAHRIDTA DPPAAGAPPV VSPLACGADL LRWIRAHWTG GPLVLGLDAS HRRDDVVLLR MSVLYRGTAL PVAWVIVPAN QPGAWEPHWE RMLRWARSAL PLDQEVLVLA DQGLWSPRLW HAIRSQQFHP IMRVRTTSTF APTGQARQSV LRLAPGPGHG WVGVGVAFKH APKRIAGTLA VAWGADHAEP WVLLTDLPPA QVDAAWYALR SWDEAGFRQS KSMGWDWQRG QVTDPDAVAW QYLVVATVTL WTVAVGTRIE DAEQQGVPPG RLKRAPPTTG APPRRRWSGT AQRVISLLRR GMQHLRWLLA QGRCWVRLWL RPEPLPKIGD SVTMHIYDPS QCLKSP
|
| |