Gene RoseRS_3619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3619 
Symbol 
ID5210597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4522488 
End bp4523954 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content58% 
IMG OID640597212 
Productextracellular solute-binding protein 
Protein accessionYP_001277924 
Protein GI148657719 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTGA GACAATACCC CTGGATTCTC TTGCTCGGTG CGTTGATGCT GGTGCTGGCT 
GCCTGTGGCG GACAACAGAC AGCTTCACCG ACCGCCGCGC CGGGGCAGGC GCCAACCACT
GCGCCAGTTA CCGACCTGCC GACTCCCACG CCGGTTCTTG AGTTTGCGCA GCAACCTCAA
CCCGGTCAGA AGGTGCTGGT ATGGATGGTG CGCATCAACG CCACCGAGAA CCGCTGGGAG
CGCGATGTCG TCCTTCCTGC CTACCAGCAG GTCGCCCCGG ATGTATTCGT AAAGGTGCTC
AATATCAACC AGGACGATAT TGCGGTCAAG CGTGAGGCAA TGATCGCCGC GAAGGAGCCG
CTGCACGTCT GGTCGTCCAA CTGGGGCGGT GATGGCTTCG CCAGCGACCG TTTCCGCGGG
CTGCTGGCAG ATCTGACCCC CTTGATCGAA CGTGATAAGT GGGACACCAG CGACTTCATT
CCTGAAGTGT TCGCCATCTA CAATGTCGAG GGCAAGCAGT ACGGCATTCC GTTCCTCACG
ACTGGCAGTT ATGTGTACTA CAACATGAAA CTGTTCGACG AGGCTGGCGT GCCCTATCCT
CCGAGCGACT GGAACGACAA GTCGTGGACA TGGGATGCGT TCCTCGAAAC CGCCAAAAAA
CTGACAAAGA ACCCGGATGA TCCGAGCACG GCTGTGTATG GCGGCGTCAA CGGGCTGTGG
CCGCCATTCG ATAGCATTCC CATGATCTGG GGGAAGGATC CGTTCACGAA GGAAGCGCTG
GAGAGCGGGT TCTCCGATCC GATCAAACTC GATGAACAGA CAGCGGCAGC CTTCCAGGCA
ATCCACGACC TGGTCTACGT CCATAAGGTC GCTCCCGACC AGGCAGCTTC CCAGGCGCTT
GATCAACTTG GCGGCGCGTT CCTCTCCGGT CGCGTGGCCA TGTTCATGAC CGGCGGTTGG
GGACACTGGA ACTACAAGGA AATTATCGAT GATCCGAATG GGTTCTGCTG GGGCGCAGCG
CCAATTCCCT GGGGCTCCCC TGATGCGAAC ATCCGCGCAA CGATCTTCAC CGACCCATGG
GTCATCACTG CTGGAATGGA CGCTGAAAAT ACCGATCTTG CCTGGAACTT CGTGAAGTTC
CTGGCGTCGG CGGAACAGCA GCGCGCCTAC ACGCTGGCAA CCGGCACCCC GCCTGTGCGT
CAGAGCCTGC TCAACGACTA CTACAAGCAG TATGAGAAGT GTGTCCCGGC GGAAAAAACC
AAAGAGTCCT TCCAGGGCGC CTTCTCTCAC GGGCGCGAGT CATCGAACCA CCTGCTGGTC
AAGTTTGATG AACTCAGCCA GACGTGGGAT AACCTGCTGA GTCCGTTCTG GAATGATCCA
AATGCAAAGG CGACCGACCT CATGCCGATC CTTGAAGCGG ATGTGAATGC CGCGTTGGAG
CGCATCCGCA AAGAAGCAGG CAGGTAA
 
Protein sequence
MRLRQYPWIL LLGALMLVLA ACGGQQTASP TAAPGQAPTT APVTDLPTPT PVLEFAQQPQ 
PGQKVLVWMV RINATENRWE RDVVLPAYQQ VAPDVFVKVL NINQDDIAVK REAMIAAKEP
LHVWSSNWGG DGFASDRFRG LLADLTPLIE RDKWDTSDFI PEVFAIYNVE GKQYGIPFLT
TGSYVYYNMK LFDEAGVPYP PSDWNDKSWT WDAFLETAKK LTKNPDDPST AVYGGVNGLW
PPFDSIPMIW GKDPFTKEAL ESGFSDPIKL DEQTAAAFQA IHDLVYVHKV APDQAASQAL
DQLGGAFLSG RVAMFMTGGW GHWNYKEIID DPNGFCWGAA PIPWGSPDAN IRATIFTDPW
VITAGMDAEN TDLAWNFVKF LASAEQQRAY TLATGTPPVR QSLLNDYYKQ YEKCVPAEKT
KESFQGAFSH GRESSNHLLV KFDELSQTWD NLLSPFWNDP NAKATDLMPI LEADVNAALE
RIRKEAGR