Gene RoseRS_2896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2896 
Symbol 
ID5209865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3614809 
End bp3616167 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content61% 
IMG OID640596492 
Productextracellular solute-binding protein 
Protein accessionYP_001277214 
Protein GI148657009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.034507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA AACTCTCACG ACGCAGGTTC CTCAAGGTTG CTGCCGCGGG CGCAGGCGGC 
ATTGGTGCAA CGGCGCTGCT GGCGGCTTGT GGCGGCGCAG CGCCTCAGGG CGGTCAACCA
ACGGGCGGGC AGGCGCAACC TGCAGCGCCG GTTCAGGGTG ACACAGTTGT CACGGAAATC
ACCTTCTGGT GGTGGGATCA GGTCGGTGAG GTCTGGAAAG AACCGTTCGA GAAGGCGCAC
CCCAACATCA AACTCAACTT CGTCAACACC CCCTTCGCCG ATGCGCACGA CAAACTTCTG
ACCTCGTTCG CCGCCGGAAG TGGCGCTCCC GATGTCGCCT CGATCGAGAT TGGTCGCGTC
GGCAATTTTA CCGCCAAGGG CGGGCTTGCC GATCTGCTGG CGCCGCCGTT CGATGCGGGC
AGTCTCAAGA ATGATATGGT CGCCTATAAA TGGACCCAGG GATCGACTGC CGATGGTCGC
CTGGTCTGCC TGCCGTGGGA TATCGGTCCT GCTGGGGTCT GGTATCGCAC GGATATTTTC
GAGGCGCTCG GTTTGCCAAC CGAACCGGAA GCGGTAGAGG AGTTGATCGG CGGTCCGAAC
CGCACGTGGG ACGATTTCTT CGCCTTTGCC AAACAACTCA AGGAAAAGAG CGGCGGGAAG
ACGTCCCTCT TTGCCGATGC CGGCACTGAT ATTTATGGCG CCGTCTATCG CCAGCAGGGT
GAGGGGTATG CCGATGGCAA CAAAGTGCTG ATCGAAGAGA AGGCGACCCG TCCGTTCCAG
CTCGCGGCGC GCGCCCGCAA GGAGGGGATC GATGCCAACA TTCCCTGGTG GGGCGCCGAG
TGGCAGACCG GCTTGAAGGA CAATGCCTTT GCCGGAATGG TGATTGCCTG CTGGATGCAG
GGCGGTCTGA CACGCGAGCA GCCCGATCTG GTCGGGAAAT GGCGTGTCAT ACGCGCTCCA
GAAGCCAATT ACAACTGGGG CGGTTCGTTC ATGGCGATCC CGGAGCAGAG CAAGAACAAG
GAGGCGGCCT GGACGTTCGT CAAGTGGGCA TGCGCAACGG CGGAAGGGCA GAACATCATG
TTCAAGGCGT CCGGCGTGTT TCCCGCATAC AAGCCAGCCT GGCAGGATCC ACTCTACGAC
GAACCGGTGC CGTTCTTCGG CGGTCAGCGC GCCTATCGCT TGTGGACCGA AATCGGTGAC
AATATCAAAG CTATCTTCCG TACACCGAAC GATCTCCAGC TCGATGACAT CGTTGGCGCA
GAACTGACGA AGGTCTTGCA GGATGGCAAG GACCCCGTTC AGGCTGCGAA GGACGCCGAA
GCAGAAGCGC TCAGGCGCAT CCCCGATCTG CAAGGATAG
 
Protein sequence
MTTKLSRRRF LKVAAAGAGG IGATALLAAC GGAAPQGGQP TGGQAQPAAP VQGDTVVTEI 
TFWWWDQVGE VWKEPFEKAH PNIKLNFVNT PFADAHDKLL TSFAAGSGAP DVASIEIGRV
GNFTAKGGLA DLLAPPFDAG SLKNDMVAYK WTQGSTADGR LVCLPWDIGP AGVWYRTDIF
EALGLPTEPE AVEELIGGPN RTWDDFFAFA KQLKEKSGGK TSLFADAGTD IYGAVYRQQG
EGYADGNKVL IEEKATRPFQ LAARARKEGI DANIPWWGAE WQTGLKDNAF AGMVIACWMQ
GGLTREQPDL VGKWRVIRAP EANYNWGGSF MAIPEQSKNK EAAWTFVKWA CATAEGQNIM
FKASGVFPAY KPAWQDPLYD EPVPFFGGQR AYRLWTEIGD NIKAIFRTPN DLQLDDIVGA
ELTKVLQDGK DPVQAAKDAE AEALRRIPDL QG