Gene RoseRS_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0725 
Symbol 
ID5207664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp892657 
End bp894327 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID640594339 
Productextracellular solute-binding protein 
Protein accessionYP_001275091 
Protein GI148654886 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTC GCATCCGCTG GCAAATCCTG ATTGCGTCGA TCAGTGCTCT GACGGTGCTG 
CTCCTGATGA GTTACCTGGC GCTCACACGC GCATCGGTGG CGCGCCCGCT GGCAGGAGGC
GATTATATCG AAGGGGTTGT CGGCGCGCCG GTGCATCTCA ATCCCCTGGT CGCTGATCCA
GCCTCCGATC CGGTCGCCGC CGACATCCAG CGCCTGGTCT TCGAAGGGTT GACCCGCCCC
GGTCCCGACG GTCTACCCAT GCCGGCACTC GCCGAGTCGT GGGCGGTTGA TGAGAGCGGG
ACAGTCTACA CCTTTACACT GCGAAGCGGC GCGTCGTGGC ACGATGGTGC GCCGGTGACG
GTCGATGACG TGCTGTTCAC GCTGCGCGCT GTGCAGGGTC CGGCGTTCGC CGGTGATCAG
AACGTTGCCG CTTTCTGGCG TACTGTCCTG GTTGATCGCG TCGGAGAACG CAGCGTCAGT
TTCCGCTTGG AAGCGCCATT TGCGCCATTT CTGCGGTTGA CCGGTTTCCC GATCCTCCCC
GCGCACCTGC TGCGTAACGT TCCGCCGGAA CAGTGGGAGG CGCATCCATT CAACCGCTTG
CCGGTCGGCG CCGGTCCGTA CCGCCTGGTG GAACTCGATG AACAGCGTGC GCTGCTACGC
GCCAACCCGC GTTATTTCGG CGCAACCCCG TTCATCGAAA CGATTGAACT GCGCTTTTTT
CGCACTGAAC AGGAAGCATT CGCTGCGCTG ACCCGCAGCG AGATTCAGGG TCTGGCATTC
ACCGGCGCCA GCGCCCTGGC GGATGTCAAC CTGCCACGCG GCATTGTGCG ACGTCAGGCG
CTGCTGGATG GATACACGGC GCTCTCCTTC AACCTGCGCG ACGGTCTGCT CACCGATCTC
GGCGTGCGAC GCGCACTGGC GACTGCACTC GACAAGGATG CCCTGATCGC CAGTGCGCTT
GCGGGGAAGG TGATGCGGCT CGATACGCCG ATTCTGCATG GATGGTGGGC GGAAACGTCC
GATGTGTCGT GGTATGAGCC AGACGTTGCC CGCGCAATGG CGCAACTTGA CACGCTGGGG
TACGTCCCGG GCGCCGATGG CGTCCGTGTT CGGGACGGTC AACCACTCGT CTTTTCGCTG
CTGACCGATA ACTCTCCAAC GCGGCGTGCG GTTGCAGAAG AGATTGCCCG TCAGTGGAGC
GCCATCGGGG TGCAGATCGT TATCGAACCG GTCGAACCGA CCGAAATGCA ACGCCGGCTG
GAAACGCATG AATTCACCAT CGCACTGCAC GGATGGCAGC GGCTCGGTTC CGATCCCGAT
GTCTTCGAAC TCTGGCATTC GAGTCAGGCG GAGCGCGGAC GCAATTACGC AGGTCTCGCA
GATGCCACTA TCGACGAGAT TCTCTCCAGC GCGCGCAAAA TCTACGACAT CACGGATCGT
GCCGAACTCT ACCGTGAATT TCAGGAACGC TGGGTCGAAC TGGCGCCGGG GATCATTCTC
TATCAACCGA TCCTGTTCCA CGCCACAGTT GCCGACCTCG GCGACACGAT TGCCGTCCCG
CCGGATGCCG CCGCTTCGCC CCATCTGCTG ATCGGGCGCG AGGGGCGTTT TGTGAATGTC
AACCGCTGGT ATCTGCGTAG CGCTCGCGAG ATCCGCGGTG ATTTGCGATA A
 
Protein sequence
MARRIRWQIL IASISALTVL LLMSYLALTR ASVARPLAGG DYIEGVVGAP VHLNPLVADP 
ASDPVAADIQ RLVFEGLTRP GPDGLPMPAL AESWAVDESG TVYTFTLRSG ASWHDGAPVT
VDDVLFTLRA VQGPAFAGDQ NVAAFWRTVL VDRVGERSVS FRLEAPFAPF LRLTGFPILP
AHLLRNVPPE QWEAHPFNRL PVGAGPYRLV ELDEQRALLR ANPRYFGATP FIETIELRFF
RTEQEAFAAL TRSEIQGLAF TGASALADVN LPRGIVRRQA LLDGYTALSF NLRDGLLTDL
GVRRALATAL DKDALIASAL AGKVMRLDTP ILHGWWAETS DVSWYEPDVA RAMAQLDTLG
YVPGADGVRV RDGQPLVFSL LTDNSPTRRA VAEEIARQWS AIGVQIVIEP VEPTEMQRRL
ETHEFTIALH GWQRLGSDPD VFELWHSSQA ERGRNYAGLA DATIDEILSS ARKIYDITDR
AELYREFQER WVELAPGIIL YQPILFHATV ADLGDTIAVP PDAAASPHLL IGREGRFVNV
NRWYLRSARE IRGDLR