Gene RoseRS_4489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4489 
Symbol 
ID5211474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5628318 
End bp5629322 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content61% 
IMG OID640598068 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001278771 
Protein GI148658566 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.311346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000667621 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCATGTCA GGATGGTTCA ACTGCGGCTG ATGCTGCTGG TGCTGGCGCT GCCCGTTATG 
CTTGCCGCCT GCACACAACC GACATCCTCG CCAGGCGGAG CCGGCGGCGA TGCTCCCCGC
CGGTGGCGTA TCGGCATGTC GCAGGCGAAC AACGCCGAAC CGTGGCACCA GGCGATGAAT
GCGCAGATCG CTGCGGCTGC CGCCCGTCAT CCCAACATTG AGATCGAGTT CACTGATGCG
CGCCAGAACA ATGCACAGCA GATTGCAGAC GTGGAAGCCT TCCTGAGCAA GGGTATCGAT
CTCCTCATCA TTTCGCCCAA CGAAGCCTCC CCGCTGACCC CGATTGTAGC GAGCGCATTT
CAGCGCGGCA TTCCGGTGAT CGTGCTGGAT CGCAAAGTGA AGGGCGAACA GTACACGATG
TGGATCGGCG CCGATAACCG CCTGATCGGG CGCAAGGCGG GCGAATATAC GGCGCGCTGG
TGCCGCGAAC AGCAGCGATC ACCGTGTACG GTCATCGAAC TGCGTGGACT GGAAGGCTCG
ACCCCCACGC AGGAGCGCGG CGACGGCTTC CGCGAAGGGA TCGCCGCCAA CCCGGATGTG
CGCATTATTG CCAGTCAGAA CGCCGACTGG CTCGCCGAAC GGGCTGACGC GCTTGCGCGC
GTTCTCTTTG AAGCAAACCC GGATGTCGAT GTGGTCTATG CCCACAACGA TCCTATGGGC
ATTGCCGCCT TCAATGTTGC AAAGGAGCAG GGGCGCGATA CCGACGCCAT CCTCTTTATC
GGCATCGATG CGCTTGCGAC CTCCGATGGC GGTATTCAGG CGGTGCGGCA GGGCAAACTC
AATGTGACAT ACGTCTATCC TACGGGCGGC GCCGAAGCCA TCGAATGGGC GCTGCGAATA
CTGGAACAGC GCGAGACGCC GCCGCGCGAA ATCATTCTCG ATACTGAAGA AGTCACCACT
GCCAACGCCG ATGCCATGTT CCAGAAATAC GGAGGCCGGG AATGA
 
Protein sequence
MHVRMVQLRL MLLVLALPVM LAACTQPTSS PGGAGGDAPR RWRIGMSQAN NAEPWHQAMN 
AQIAAAAARH PNIEIEFTDA RQNNAQQIAD VEAFLSKGID LLIISPNEAS PLTPIVASAF
QRGIPVIVLD RKVKGEQYTM WIGADNRLIG RKAGEYTARW CREQQRSPCT VIELRGLEGS
TPTQERGDGF REGIAANPDV RIIASQNADW LAERADALAR VLFEANPDVD VVYAHNDPMG
IAAFNVAKEQ GRDTDAILFI GIDALATSDG GIQAVRQGKL NVTYVYPTGG AEAIEWALRI
LEQRETPPRE IILDTEEVTT ANADAMFQKY GGRE