Gene RoseRS_4599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4599 
Symbol 
ID5211585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5763828 
End bp5765693 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content61% 
IMG OID640598178 
Productextracellular solute-binding protein 
Protein accessionYP_001278880 
Protein GI148658675 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.593059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCGA TGCGTAACGT GTGGCGGAAG AGCGGTGCGC TGGCGCTGCT CCTCATCCTG 
ATCGTCCCGG TTCTGGCAGC GTGTGGCGGG CAGCCAGCCA CAGCACCGAC CACGGCGCCA
GCGGCGCAAC CGACGACTGC ACCCACTGAG GCGCCAGCGG CGCAACCGAC GACTGCGCCC
ACTGAGGCGC CAGCGGCGCA ACCGACGACT GCGCCGACCA CTCCGCCAGC GGCGCAGCGC
GGCGGTCGCC TGAAGATCCT GTACTGGCAG GCGGTGACGA CCCTCAACCC TCACCTGGCG
ACAGGTACGA AGGACTTCGA TGGCGCAACG GTTATCCTCG AACCGCTGGC GCGCTACAAC
GAGAAGGATG AACTGGTGCC CTTCCTGGCG GCGGAGATTC CAACGATCGA GAACGGCGGC
GTCGCTCCTG ATGGAACCAG CGTCACGTGG AAGCTCAAAC CAGGGCTGAA GTGGTCGGAT
GGCAGCGATT TCACGGTTGA CGACATTATC TTCACCTGGC AGTACTGCGC CGATCCTGCG
ACTGCCTGCA CCACCAAGGC GGTCTTCGAC CCGATCGAGA AGGTCGAGAA GGTCGATGAT
ACCACTGTCA AGATCATCTG GAAGGAGCCG AACGCCAACC CGTACATCTC GTTCGTCGGT
CCGAACGGCA TGATCCTGCA GAAGAAGCAG TTCGAGAAGT GCATCGGCGC GGCAGCCAGC
ACCGATGCGG CGTGCCAGGC GGCGAACCTC GCGCCGATCG GTACAAATGC CTGGAAGTTG
AAGGAGTTCA AGCCGGGCGA TGTGGTGATC TACGAACGCA ACCCCTTCTT CCGCGATGCC
GATAATGTCT TCTTCGACGA AGTGGAGATC AAGGGCGGCG GTGATGCTGC GTCAGCAGCG
CGCGCCGTGT GTGAGACGGA AGAGGTCGAT TTCGCCTGGA ATCTCCAGAT CCCGAAAGCG
GTTCTCGAGC CGATCCTTGC TTCCGGTAAG TGCGACCCGA TCGCCGGCGG TTCGTCCGGA
GTTGAGCGCA TTGTGGTCAA CTTCTCGAAC CCCGACCCGG CACTGGGCGA CAAGCGCAGC
GAGCCGGATC AACCCCACCC CTTCCTGACC GACCCTGCAG TGCGCAAGGC GATCTCGCTG
GCGATTGATC GCAAGGCAAT CGCCGAGCAG TTGTACGGTC CTACCGGCGA ACCTACCTGC
AACGTGCTGG TGGTGCCGAC TGCGGTCAAT TCGCCAAACC TGACATGCAA CCGCGATGTC
GAAGCAGCGA AGAAGTTGCT CGAAGATGCG GGCTGGAAGT TGAAAGGCTC GGTGCGGGAG
AAGGAGATCG GGGGGAAAAC GGTCAGACTG GTCGTCAGTT TCCAGACCTC GATCAATCCG
CTGCGCCAGA GCACCCAGGC GATCATCAAG TCGAACCTGG CGGAGATCGG TATTCAGGTG
AACGTCAAAG CCATTGATGC CAGCGTCTTC TTCAGTGGCG ATCCAGGCAA CCCGGATACC
CTGAACAAGT TTTACGCCGA CCTCCAGATG TACACCAACG GTCCGAACAA CGCCGATCCA
CAGCAGTACC TCCAGGGCTG GACCTGCGAA GAGCGCGCTT CGGTCGCCAA TCAGTGGAAC
GGCAACAATG ACGGTCGCTA CTGCAACCCG GAGTACGACA AGCTTTTCGA GGAGTTGAAG
AAGGAACTTG ACCCGAAGAA GCGCGTCGAA CTGGCGATCA AGATGAACGA TCTGCTGGTG
ACCGATGGTG CGGTGATTCC GCTGATCAAC CGCCAGACGC CGAATGCGAA GGTGAAGGCA
CTCAAGGGAC CCACCTTCAA CACGTTCGAC TCCGACATCT GGAATATTGC CTCCTGGAGC
AAGTAA
 
Protein sequence
MRSMRNVWRK SGALALLLIL IVPVLAACGG QPATAPTTAP AAQPTTAPTE APAAQPTTAP 
TEAPAAQPTT APTTPPAAQR GGRLKILYWQ AVTTLNPHLA TGTKDFDGAT VILEPLARYN
EKDELVPFLA AEIPTIENGG VAPDGTSVTW KLKPGLKWSD GSDFTVDDII FTWQYCADPA
TACTTKAVFD PIEKVEKVDD TTVKIIWKEP NANPYISFVG PNGMILQKKQ FEKCIGAAAS
TDAACQAANL APIGTNAWKL KEFKPGDVVI YERNPFFRDA DNVFFDEVEI KGGGDAASAA
RAVCETEEVD FAWNLQIPKA VLEPILASGK CDPIAGGSSG VERIVVNFSN PDPALGDKRS
EPDQPHPFLT DPAVRKAISL AIDRKAIAEQ LYGPTGEPTC NVLVVPTAVN SPNLTCNRDV
EAAKKLLEDA GWKLKGSVRE KEIGGKTVRL VVSFQTSINP LRQSTQAIIK SNLAEIGIQV
NVKAIDASVF FSGDPGNPDT LNKFYADLQM YTNGPNNADP QQYLQGWTCE ERASVANQWN
GNNDGRYCNP EYDKLFEELK KELDPKKRVE LAIKMNDLLV TDGAVIPLIN RQTPNAKVKA
LKGPTFNTFD SDIWNIASWS K