Gene RoseRS_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2159 
Symbol 
ID5209121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2657225 
End bp2658922 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content59% 
IMG OID640595760 
Producthypothetical protein 
Protein accessionYP_001276489 
Protein GI148656284 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTTT CACGGCGTCG CTTCCTTGGC GCCAGCGCAA CCATGGCAGG CGCAGGGATA 
GTTGATTTTC CCGGAAAACA CACCAGCAGC ATCACCTTTC GGACACATTC GCAGGCAGAT
CCAGCGGCAA ACCCTCCCAT CGAACTGATC GCGCTCAACC GAATGGCGTA CGGACCTCGA
CCCGGCGATG TCGCGCGTGT CCAACAGATG GGATTGACCG CATATGTTGA TGAACAACTC
AACCCCAACG ATGCGGACGA TGCGCTATGT GCATCGAAAC TTGTCAGTGC ACGTCTGCGC
ATCCAGTACG ACGCTGGCAT GGGCTATCCG GCAGTCGATG AAATGCGCTC GCTGCGCACG
ATCATCGAAA ACTGGGGACT GGCACAACTC TGGCCCCTCA CAAAGCATCC AGCTTATCAG
GAGCGCATCC GTCCCGTTGA GGAGGTGCGC GCTGCCACGC TGATCCGCGC CGTCTACAGC
AAATGGCAGC TGCGTGAGGT GCTGGTCGAG TTCTGGCACA ACCACTTCAA TGTCGATGCC
TACTCTGACA CCCGCATTAG CGCAACCTGG CCCCTGTACG ACCGTGATGT CATCCGGCGG
CACTGTCTGG GCAATTTTCG GAACATGGTG CGGGATGTGG CGAAAAGCAT CGCCATGGGG
TTCTACCTTG ATAACGCCTT CAGCCGCGAC GGTCCAGCGA ATGAGAATTA TGCCCGCGAA
CTGTTCGAGT TGCATACGCT GGGGCAGGAG AACTATCTCA ACCACCTGTA CAACCGCTGG
CGCGATGTTC CCGGCGCGCT TGAAGGAAAT CCGATCGGCT ACATCGATCA GGATGTGTAC
GAGGCGGCGC GTGCATTTAC CGGATGGACG ATTGCCCACG GTCAGACGAT CAGCGGCAGT
CTGCGTCTGC CGGATACGGG TGAGTTTGCG TATGTCGATC TCTGGCACGA CAACGCACAA
AAGAGAGTGC TGGCTTTTGA GATCGACCCC AACCGACCGC CACTTGCGGA CGGCGAGGAT
GTGATTCGCC TGGTCAGCCA GCATCCCGGC ACGGCGCGAT ACATCTGCCG CAAACTCTGC
CGTCGCCTGC TCGCCGACGA TCCGCCCGCC TCGCTGGTGA ACACGCTGGC GAACATCTGG
CTCACCAACA AGGACGCACC CGATCAGATC GCCAGAACCG TTCGCGCGCT CCTCCTGTCG
GACGAATTCG CCAGCACGTT CGGCAGAAAG GTGAAACGAC CGTTTGAAGT GGTGGTTTCA
TTCCTGCGTG CCACGAATGC GGAGGTCACG CCCAACCGCG ATCTCTTCTG GCAGTTGCAG
GAGATGGGGT ATCGGATGTT CAACTGGGGA CCGCCAACCG GACACCCGGA GGAGAGCGCG
GCGTGGCTGA GCACGAATGG CATGCTCCGG CGCTGGAATA TCATCAACCA GTTGCAGAGC
ACCTGGCTCA AGGCTGCAAC GTTCGATCTG CCAGGGCAGA CGCCACCAGG CTTGACGGCA
CGGCAGATCG TGGAGTTCTG GGTCGGGCGT CTGCTGGCTG TGCCACCTCC AACATCAACC
ATGAATCGCC TGATCGACCT GATGCGCCAG AACGGTTCGG CGGACGCCCC GCCGACCGGC
AGCACCGACG AGATTGTCGA CCGCATCAAG AGTGTTGTGA CGTTGATTGC GATGACGCCT
GAGTTTCAAC TGCGCTAG
 
Protein sequence
MTFSRRRFLG ASATMAGAGI VDFPGKHTSS ITFRTHSQAD PAANPPIELI ALNRMAYGPR 
PGDVARVQQM GLTAYVDEQL NPNDADDALC ASKLVSARLR IQYDAGMGYP AVDEMRSLRT
IIENWGLAQL WPLTKHPAYQ ERIRPVEEVR AATLIRAVYS KWQLREVLVE FWHNHFNVDA
YSDTRISATW PLYDRDVIRR HCLGNFRNMV RDVAKSIAMG FYLDNAFSRD GPANENYARE
LFELHTLGQE NYLNHLYNRW RDVPGALEGN PIGYIDQDVY EAARAFTGWT IAHGQTISGS
LRLPDTGEFA YVDLWHDNAQ KRVLAFEIDP NRPPLADGED VIRLVSQHPG TARYICRKLC
RRLLADDPPA SLVNTLANIW LTNKDAPDQI ARTVRALLLS DEFASTFGRK VKRPFEVVVS
FLRATNAEVT PNRDLFWQLQ EMGYRMFNWG PPTGHPEESA AWLSTNGMLR RWNIINQLQS
TWLKAATFDL PGQTPPGLTA RQIVEFWVGR LLAVPPPTST MNRLIDLMRQ NGSADAPPTG
STDEIVDRIK SVVTLIAMTP EFQLR