Gene RoseRS_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3158 
Symbol 
ID5210128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3978064 
End bp3979314 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID640596749 
Producttransposase, IS4 family protein 
Protein accessionYP_001277469 
Protein GI148657264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CGTCCACAGA ACGGTTGCTG CAGCAGCAGA TCCAGACCCT CTTCCCGCGC 
TTGTCGCGTC ACCGGCGCCG CGCCCTGGCC CGCTGGGTCT TGGGCGCCTT ATTGGCGGGG
AGCGCCAATC GTCCCGCGCT GGTGCACGCG CTCGCCACTG CCGGGATCGC CCGCGCCGCC
ACCCTGGCGG ACGCCTGGGA TGCCTGGATC GCCGCCCCGG CCCATCGCAT TGACACCGCC
GACCCACCCG CAGCGGGTGC GCCACCGGTG GTCAGTCCGC TGGCGTGCGG CGCCGACCTG
CTCCGCTGGA TTCGCGCGCA CTGGACCGGC GGACCGCTGG TGCTTGGGCT GGATGCCTCC
CATCGGCGCG ATGACGTCGT TCTGCTGCGC ATGAGCGTCC TCTATCGGGG CACCGCCCTG
CCGGTCGCCT GGGTGATCGT CCCGGCGAAC CAACCGGGTG CGTGGGAACC GCACTGGGAG
CGGATGCTGC GCTGGGCCCG CAGCGCGCTG CCGCTCGACC AGGAGGTCCT CGTGCTGGCG
GATCAGGGGT TGTGGAGCCC CCGGCTGTGG CACGCCATCC GGTCGCAGCA GTTCCATCCC
ATCATGCGGG TGCGCACCAC GTCGACCTTC GCGCCGACCG GTCAGGCGCG CCAGTCGGTG
CTGCGCCTGG CGCCCGGACC GGGGCATGGA TGGGTGGGCG TGGGGGTCGC CTTCAAGCAC
GCACCCAAGC GGATTGCGGG CACGCTGGCG GTGGCGTGGG GCGCCGACCA TGCGGAACCG
TGGGTGCTGC TGACCGATCT GCCGCCCGCG CAGGTGGATG CCGCGTGGTA TGCCCTGCGC
AGTTGGGATG AGGCGGGCTT CCGCCAAAGT AAGTCGATGG GCTGGGACTG GCAACGCGGT
CAGGTGACGG ACCCGGATGC AGTCGCCTGG CAGTATCTGG TAGTGGCGAC GGTCACGCTG
TGGACGGTGG CCGTCGGCAC GCGGATCGAA GATGCAGAAC AGCAGGGGGT TCCGCCCGGT
CGCCTGAAGC GGGCGCCGCC GACGACCGGC GCGCCGCCGC GCCGTCGCTG GAGCGGCACG
GCGCAGCGGG TGATCAGCCT GCTCCGGCGG GGGATGCAGC ACCTGCGCTG GTTGCTGGCG
CAAGGGCGTT GTTGGGTCCG CTTGTGGTTG CGCCCGGAGC CCTTGCCCAA AATAGGTGAC
AGCGTAACCA TGCATATCTA TGACCCGTCC CAATGCCTGA AATCGCCCTA A
 
Protein sequence
MSRTSTERLL QQQIQTLFPR LSRHRRRALA RWVLGALLAG SANRPALVHA LATAGIARAA 
TLADAWDAWI AAPAHRIDTA DPPAAGAPPV VSPLACGADL LRWIRAHWTG GPLVLGLDAS
HRRDDVVLLR MSVLYRGTAL PVAWVIVPAN QPGAWEPHWE RMLRWARSAL PLDQEVLVLA
DQGLWSPRLW HAIRSQQFHP IMRVRTTSTF APTGQARQSV LRLAPGPGHG WVGVGVAFKH
APKRIAGTLA VAWGADHAEP WVLLTDLPPA QVDAAWYALR SWDEAGFRQS KSMGWDWQRG
QVTDPDAVAW QYLVVATVTL WTVAVGTRIE DAEQQGVPPG RLKRAPPTTG APPRRRWSGT
AQRVISLLRR GMQHLRWLLA QGRCWVRLWL RPEPLPKIGD SVTMHIYDPS QCLKSP