Gene RoseRS_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3050 
Symbol 
ID5210018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3828692 
End bp3830965 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content59% 
IMG OID640596642 
Productextracellular solute-binding protein 
Protein accessionYP_001277364 
Protein GI148657159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTCC GAAACAGATG TTCACCACGT TCCTCACACT CTGACATCAG CTACGAGGAG 
GTACCTGTGT CTAGAACACG CACACTGAGC CGACGACGAT TCCTGACGCT TTCGGCGATG
ACTGCTGCCA GTGCGGCGAT TGCTGCGTGT GGCGGCGGTC AGCCTGCTCA GGCGCCAACG
TCTGCACCGG CGCCGACATC TGCACCGGCG CCGACATCTG CTCCCGCGCC AACTGTTGCC
ATCCCGCCCA CCACGCCGCC CCAGGCGACT GCTGTGCCCC AGGCGGTCAG CAGGTTCAAA
GAGTCGCCCG AACTGGCAAA ACTGGTCGCC GAAGGGAAAC TGCCGCCCGT TGATCAGCGT
CTGCCGAAGA ACCCTTACGT TGTGCCGCAC AAATGGCTCA GTGTCGGCAC GTATGGCGGG
ACGCTCAACT TCACCAACTC GTGGGGACCC GATGGTATGG CGACGATTGT GCAGGAGAGC
CAGTACGGTC ACTCGATCCT GCGCTGGCTT GACGATGGCT TGAAGATTGG TCCGGGGCTT
GCTGAAAGTT GGGAAGCGAA TGCCGATGCC AGCGAGTGGA CGTTCAAGTT CCGCGAGGGG
CTCAAGTGGT CTGACGGGCA TCCCTGGACG GTCGATGACA TTCTGTACTG GTGGGAGTAT
ATGGTCGGCG GCAACGGGAA GGAGAAAGAG TTTCCGGAGG GGTTGAAGCC GATCGAGCCG
CCGCCGGACG AGGGACGTTC CGGTACTGGA ACGCTGGCGA CTCTCATCAA AGTCGATGAC
TACACGCTGA CGATGAAGTT CGATGCGCCA GCGCCGCTGA CTGCTGATCG CCTGGCGATG
TGGGTCAACG CAGGCATCGG ACCGCGCTGG ATGGCGCCGC GGCATCATAT GGAGCAGTTC
AACCCGGTGC TCAACCCTGC AAAATACAAA GACTGGGACG AACATACCAA GCGCATCCGC
TTCGTCACCA ACCCCGATTG CCCGACGATG ACGGGATGGA AGTGTGAAAG TTTTGAGGAG
GGCGTGCGTG GCGTTTATAC CCGCAACCCG TATTACTGGT GTGTCGACGC TGAGGGGAAT
CAACTGCCAT ACATCGACCG CATCGTTTCG ACCACCTTCC AGAACAGCGA AGTCGAGAAG
TTGAATGTCA TCCAGGGGAA GAGCGACTTC TCGCACCACT GGGTGCTCAG TCTCGATGAT
GTGCCCAACC TGCGCCAGAA TGCTGAAGCA GGCGGGTATG AAGTGCGGTT CTGGGAGAGC
GGATCCGGGT CGGGAACCTC GTTCTTCTTC AGTTACGACT ACAACGATCC GAAGTGGCGG
GAATTGATCC GCAATCCGAA GTTCCGGCAG GCGCTCTCGC TGGCGTTCAA CCGCGCTGAA
CTGCAAAAAA CGCAGTACTT CGGCACCGGT GAGTTGACAA CCGGCACCTT CAGCCCGAAA
GCCATCGAGT ACAATATCGA CGAAAAGGGG AACCCGGACC GGGCAAACGC GCGGTACCGC
GAATGGCGCG ATAGTTACGT GGCGTTCGAT CAGGAACGGG CAAAGAAACT GCTCGACGAC
ATTGGGGTCA AGGTTGGTCC CAACGGTTTC CGTACCTTCC CCGACGGTTC ACCGCTGGAG
ATTTTGCTGG TCGTCGCCGC CAACACCAGC AAGAACACGA TTGCTCAGAA TGAGCAGATG
ATCCGCGACT GGGGGCAGAT CGGCATCAAG GCGACGCTGA CCCCGGTGCC GCCGCAGGGA
CGCCGTGAGG ACTGGTTCGC CGGCAAACTG ATGTCGAACG CCGACTGGGG CATCGGTGAT
GGTCCGAACC ATCTGGTGTA TCCGCAGTGG CTGGTGCCGA TTGAGCCGGA ACGCTGGGCG
CCGCTTCACG GACAGGGGTA TCAGTTGCGC GGTACGGCAA GCGAGAAGGA AGAACTGGAC
AAAGATCCGT GGCAACGCGC GCCTGCCCGA ATCGTGCCGA CCGATAAGGA TTTCGACCCG
ACCATCGCAA AACTGCACGA GATCTACGAT AAGACGAAAG TCGAACCGGA TTTCCTGAAG
CGCACACAGA TGGTCTGGGA GATGATCAAG ATCCACGTCG AGAACGGTCC GTTCGTGCAG
GGGTCGGTAG CGAACTTTGA CCGCGTGTTC ATCGTCAAGA AGGGCTTGAT GAACGTTCCC
CGGAAAGAAG ACCTTGCGCT CGGCGGCTTC ACCGATCCCT GGATCCATCC AACGCCGGCG
GTGTACGATC CTGAAGCATG GTATTGGGAC GATCCGGCGA AGCATAGCGG TTGA
 
Protein sequence
MFFRNRCSPR SSHSDISYEE VPVSRTRTLS RRRFLTLSAM TAASAAIAAC GGGQPAQAPT 
SAPAPTSAPA PTSAPAPTVA IPPTTPPQAT AVPQAVSRFK ESPELAKLVA EGKLPPVDQR
LPKNPYVVPH KWLSVGTYGG TLNFTNSWGP DGMATIVQES QYGHSILRWL DDGLKIGPGL
AESWEANADA SEWTFKFREG LKWSDGHPWT VDDILYWWEY MVGGNGKEKE FPEGLKPIEP
PPDEGRSGTG TLATLIKVDD YTLTMKFDAP APLTADRLAM WVNAGIGPRW MAPRHHMEQF
NPVLNPAKYK DWDEHTKRIR FVTNPDCPTM TGWKCESFEE GVRGVYTRNP YYWCVDAEGN
QLPYIDRIVS TTFQNSEVEK LNVIQGKSDF SHHWVLSLDD VPNLRQNAEA GGYEVRFWES
GSGSGTSFFF SYDYNDPKWR ELIRNPKFRQ ALSLAFNRAE LQKTQYFGTG ELTTGTFSPK
AIEYNIDEKG NPDRANARYR EWRDSYVAFD QERAKKLLDD IGVKVGPNGF RTFPDGSPLE
ILLVVAANTS KNTIAQNEQM IRDWGQIGIK ATLTPVPPQG RREDWFAGKL MSNADWGIGD
GPNHLVYPQW LVPIEPERWA PLHGQGYQLR GTASEKEELD KDPWQRAPAR IVPTDKDFDP
TIAKLHEIYD KTKVEPDFLK RTQMVWEMIK IHVENGPFVQ GSVANFDRVF IVKKGLMNVP
RKEDLALGGF TDPWIHPTPA VYDPEAWYWD DPAKHSG