Gene RoseRS_2808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2808 
Symbol 
ID5209777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3499400 
End bp3501583 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content62% 
IMG OID640596407 
Productextracellular solute-binding protein 
Protein accessionYP_001277129 
Protein GI148656924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG CACGGAAGCT CAACCGGCGC ACATTCCTGC GCCTGTCAGC CGTCACGGCG 
GCCAGCGCCG CTATCGCAGC CTGTGGCGGC CAGCCTGCCG CACCTCAACC GACCACTGCG
CCTGCCGCGC CTCAACCGAC CACTGCGCCC GCCGCGCCCG CTCCGACCAC TCCTCCCGCA
GCGACTGCCG TTCCTCCGGT GACAACCAAG TTCAAAGAAG CGCCGATGCT GGCGAAACTG
GTGCAGGAGG GCAAGTTGCC GCCGGTGGAT GAGCGCCTGC CGAAGAACCC CTACACGCCG
CCCCACTCCT GGCTTACCGT CGGCAAGTAT GGCGGTGTGC TGAAGAAGAC CTACAACAAC
AACTGGGGTA TTACCGGTTT CATCCACGAG ATGCAGTATG GCTCGTCGCC GCTGCGCTGG
CTCAAGGATG GACTGGCGAT CGGTCCCGGT TTTGTTGAAA GCTGGGAGTC GAATGCGGAC
GCGAGCAAGT GGACGTTCAA GATCCGCGAG GGGATCAAGT GGAGCGATGG TCAGCCGTTC
ACCACAAAGG ACATCATGTA CTGGTGGGAG TATACGGTTG GCGGTAACGG CAAGGAGAAG
GAGTTCCCCG CCGGCCTCAA GCCGATCAAC GCCCCGCCGG ATGAAGCGCG ATCCGGCACC
GGCACGCTGA TGACCCTCAA CGCACCGGAT GACTATACCT TCGAGATGGT CTTCGATGCA
CCGGCGCCAC TCACGGCGGA CCGCCTGGCG ATGTGGGTCA ATATGTTCAT CGGTCCGGCA
TGGGTGATGC CGCGCCACTA TATGGAACAG TTCAACCCGG TGCTCAACCC CGATAAGTAC
AAGGACTGGG AAGAGCATCA GCGCAAGTTC AACCACAACA ACCCCGACTG CCCGCGTCTG
ACCGGCTGGA AACTGGATGT GTTCGAGGAA GGCGTCCGCG CTGTCTGGTC GCGCAACCCC
TACTATTGGG CAGTCGATAA GGAAGGCAAT CAGTTGCCGT ATATCGATCA GATCATTGTG
ACGGCGGTCA AGGACAAGGA GATCGAGAAA CTGGCGTACA CGGAGGGACG CGCCGACCAC
GCGCACTTCC ACGGTCAGGG TCTGGCAGAT GTCCAGTCGC TACGCGATGC CGAGTCCAAG
AGCCAGCTCG AAGTTCGCTT CTGGGACTCT GGATCGGGCA CCGGTTCGCT CTACTTCTTC
AACATGGACT TCAAAGACCC GAAGATGCGC GCGGTGTTCC GCGATCCGAA GTTCCGCCAG
GCGCTGTCGC ACGCCTACAA CCGCGCCGAT GTGCAGAAGG CGGTCTACTT CGGGCTTGGC
GAGTTGACCA CCGGCACCTT CAGCCCGAAG GCGATCGAGT ACAACATCAA CGATCAGGGC
AAGCAGGTGT ACGCTGCATG GCGCGATAGC TACGTCAAGT ACGATCCGGC GCTGGCGGAG
AAGATTCTGG ATGAAGCAGG CTACAAGAAG GGTCCCGACG GCAAGCGCAC CATGCCGGAT
GGCAGCCCGC TGCAGATCCA GGTGACCTAT GGCGCCAACG CAACGCCTGG CGGCGAGCAC
CTGTCGAAGA ACGAGCGTCT GGTGCGCGAC TGGCAGGCGA TCGGCATTGA TGCAGTGCTG
ACGCCCATTC CAGGTGAGGG CGCCGATGAG AAATGGCGCG CTGGTGAAAT TCCGATGAAG
ACAGAGTGGG AAGTTGGCGA CGGCCCGAAC CACCTGGTCT TCCCATCGTG GCTGGTGGCG
GACGAGACCG AGCGCTGGGC GCCGCTGCAC GGGCGCGGGT ACACACTGCG CGGCACGGCG
TCGGAGAAGG AAGAACTGGA CAAGAATCCG TGGGATCGCA ACCCGCCGCG TATCAATCCC
ACCGATCCGG ATTATATGCC GGCGATTAAG AAACTCCACG ACCTGTTCGA CAAGAGCAAG
GTGGAACCGG ATGCCATGAA GCGCCACCAG CTCGTGTGGG ACATGATCAA GGTCCACATC
GAGGAGGGGC CGTTCTTCAC CGGAACGATC GCCAACCCGC CGCGCATTAT CCTGGTCAAG
AAGGGCCTGA TGAACGTACC AACCCGCGAT GACCTGTTGA AGGAAGGGTT GGGTGGCTTC
GTCAACCCGT GGATCATCCC GTCGCCGGCG GTCTATGACC CGGAGACCTG GTACTGGGAT
AACCCCGACG CGCACCAGGC GTAG
 
Protein sequence
MSNARKLNRR TFLRLSAVTA ASAAIAACGG QPAAPQPTTA PAAPQPTTAP AAPAPTTPPA 
ATAVPPVTTK FKEAPMLAKL VQEGKLPPVD ERLPKNPYTP PHSWLTVGKY GGVLKKTYNN
NWGITGFIHE MQYGSSPLRW LKDGLAIGPG FVESWESNAD ASKWTFKIRE GIKWSDGQPF
TTKDIMYWWE YTVGGNGKEK EFPAGLKPIN APPDEARSGT GTLMTLNAPD DYTFEMVFDA
PAPLTADRLA MWVNMFIGPA WVMPRHYMEQ FNPVLNPDKY KDWEEHQRKF NHNNPDCPRL
TGWKLDVFEE GVRAVWSRNP YYWAVDKEGN QLPYIDQIIV TAVKDKEIEK LAYTEGRADH
AHFHGQGLAD VQSLRDAESK SQLEVRFWDS GSGTGSLYFF NMDFKDPKMR AVFRDPKFRQ
ALSHAYNRAD VQKAVYFGLG ELTTGTFSPK AIEYNINDQG KQVYAAWRDS YVKYDPALAE
KILDEAGYKK GPDGKRTMPD GSPLQIQVTY GANATPGGEH LSKNERLVRD WQAIGIDAVL
TPIPGEGADE KWRAGEIPMK TEWEVGDGPN HLVFPSWLVA DETERWAPLH GRGYTLRGTA
SEKEELDKNP WDRNPPRINP TDPDYMPAIK KLHDLFDKSK VEPDAMKRHQ LVWDMIKVHI
EEGPFFTGTI ANPPRIILVK KGLMNVPTRD DLLKEGLGGF VNPWIIPSPA VYDPETWYWD
NPDAHQA