Gene Rcas_4429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4429 
Symbol 
ID5541942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5697498 
End bp5698790 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content62% 
IMG OID640896527 
Productextracellular solute-binding protein 
Protein accessionYP_001434463 
Protein GI156744334 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGCC GTCTGGTTGC CATTCTACTG GTTGCTGCGC TGCTCGCCGC ATGTGAGTCG 
GGCAGCACGC CCGAAACCTC TTCCGCTCCG ACGACTGTTG CACAAGATTC CAGCGTTGGA
CCACTCCTCC TCTGGCATGG ATGGTCGGGC GGTGATCGGC AGGCGCTGGG GCGCCTGGTG
GATCGCTATA ACCGTCAGCA GCGCGACGGG CGTATTGTGC TGCAATCAGT GCCGCTGGCA
GGCTTCGCCG CCGAACTGCG CGCAGCCGTG GCAGCAGGCA GTGGTCCTCA TCTGATTCTG
ATTCCAAATA CCTGGATCGG TGGGCTGGCG GAGGCGGGGG TGTTGCTGCC GCTCAACGAT
CTGGTGCCGG CGCAGGAGAC CGGTACGCTT CTGCCGGTGA CGCTGGCGGG AGCGCAGGCG
CGCGATGCCG CCGGAACACT GCGGTTGTAC GGTGCACCGG TACGCTTCGA TACGCTGGCG
CTCTACTACA ATGCTGCCAA TCTCACCGAG CCCCCCGCCG ATACCGCAAC CATGCTCGCT
GTTGGACGCG GCTTGAGCGA CCCGGAAGCC CAACCACCGA TCTGGGGACT GGCGCTCAAC
CTGTCGTATG ACAATATGAT CGGGTATCTC TACGCCTTTG ACGGGCGGAT ATTCGATGAC
AACGGGCAGG TTGCGCTCGG TACAGCCGGT CGTGCTGGCG CAGAACAATG GCTCGCCTGG
TTGATCGCGC TGCAAAATGA TCCGCGCATT CTGGCGCGGA GCGAGAGTAG CATCCTGGTC
GATCGTGAAT TGAAAGATGG GCGCGCCTTT ATGACGTTTG ATTGGGCGCA TCAGATCGGT
GTCTATCGTG GTCTGTGGGG CAATCAGATC GGCATTGCGC CGTTGCCACG TCTGAGTGAA
ACGGGACGGG CGCCACGTCC ATATGTGCGC GCAGATGTCC TGGCGATCAA TAATCTTGCC
GGGGTACGTG AGCGCGAGGC GGCTGCACGG TTTATCCGTT TCATGATCAG CGAAGAAGCG
CAGGCTGTTC TGCTGCAAAG TGATATGCAA CCGGCATCGC GCACACTGGC GCTGACCGGC
GATTCGCCAC AGGAGATCGC CGCACAGGTG TTTCGCGTCC AGGCGGAACA GGGGCTTCCC
ATGCCCAACT CGAGTGTGCG CGCCTTTGTG GAGCAGGAAA TCAAACGCAT GCAACGCCAG
GCGTCGCTCG GTCTCACCAC ACCATCCGAT GCAGTTACTG AGGCCGACCG CCGGCTGCGC
GAACGATTGG AACCTTCTGC GCCAATGCCT TAA
 
Protein sequence
MRCRLVAILL VAALLAACES GSTPETSSAP TTVAQDSSVG PLLLWHGWSG GDRQALGRLV 
DRYNRQQRDG RIVLQSVPLA GFAAELRAAV AAGSGPHLIL IPNTWIGGLA EAGVLLPLND
LVPAQETGTL LPVTLAGAQA RDAAGTLRLY GAPVRFDTLA LYYNAANLTE PPADTATMLA
VGRGLSDPEA QPPIWGLALN LSYDNMIGYL YAFDGRIFDD NGQVALGTAG RAGAEQWLAW
LIALQNDPRI LARSESSILV DRELKDGRAF MTFDWAHQIG VYRGLWGNQI GIAPLPRLSE
TGRAPRPYVR ADVLAINNLA GVREREAAAR FIRFMISEEA QAVLLQSDMQ PASRTLALTG
DSPQEIAAQV FRVQAEQGLP MPNSSVRAFV EQEIKRMQRQ ASLGLTTPSD AVTEADRRLR
ERLEPSAPMP