Gene Rcas_2546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2546 
Symbol 
ID5540028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3286148 
End bp3287614 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content61% 
IMG OID640894676 
Productextracellular solute-binding protein 
Protein accessionYP_001432643 
Protein GI156742514 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.39868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.139719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC ACCACGAGGA AGAACTCGCG CAGGAGTTGA AACGGCGTGG CTTGAGTCGC 
CGCGAGATTT TGCAGATCGG CGCACGCCTC GGGTTGAGTA GCGCTGCTGT GGCGGCCGTG
CTGGCGGCAT GCGGGCAGGC GCCGCAGCAA TCGGGTACGG GCGCTGCGCC GACCACGCCT
CCCGCAACCG AGCCGACACC GACCTCACTC TACGATCTGG AAGAGGGCAA TAGCGGCTGG
CCCACCAATG CCATTGCCGA TCCGTCCGAG CGGGTGGAGA TTAGTGTCGC GCACGCCTGG
GACGCTGTCT TCTTCGAGCG TCAGAAGCAG TTTGATACGC TCTTTATGCA GCGCCACCCG
AATATCGTCG TCAAAGCCGA GAATACGCCC TTCGGCGAGT ACCGCCAAAA ATACGTCGCG
CAGGCGGCCG GCAACGCCTT GCCCGATATC ATGTACTGCC AGTTTTCATG GGCGCAGGAG
TTCATCAAGA ACGGCCTGTT CCGTCCGCTC GACGACTACA TCGCTAAAGA GAAGGACTTC
AACCTGCAAG ACTTCACGCC GCAGTCGCTG GTGTCGTACC AGCGCGACGG CAAACTGTGG
GGCATTCCAT ATGATGAAGG TCCCGCCAAC CTGTACTACA ACAAAGATAT TTTCGATGCC
GCCGGGATTC CCTACCCCGA CGAAACCTGG GACCTCGAAA AGTTGAAGGA AGTCGCGCTG
AAACTGACGC AGGGCGAAGG ACCGAACAAG ATTTTCGGGC TGGGCGAACT CCCATCCCTG
GGCGACTCGC TGGTGGCGCC ACCCTACCTG ATGCCCTTTG GCGCTCAGTA CCTGCGCGAA
CCCAAGGAGG ACGAGTGTCT GATCAACCAG CCCGAAGCGG TTGCCGCCCT CGAATGGTGG
CAGGAGTTGC GCGATAAGGG TGCGGTGCCC AGCCCCGCCG ACCTGCAAAA CGTCGCCTGG
CCCGCATTCC AGTTCGGCAA GATTGCGATG ACCATGCAGG GTTCGTGGGC GACTCCGCCG
ATCCGCGCCG GCGCCAAGTT CAACTGGGAT ATTGCGATGT GGCCTAAAGG TCCGAAGGCG
CATGTCACCT TCTCCGCCGG AAGCGCCTAC ATGATCACAC GCGACAGCAA AAACCCTGAC
GCGGCGTGGA TCTATCTGAA CGAATACCTC TCGACCGCCG GGCAATCGTA TATGTGGGGA
ATTACCGGGC GCGGCAGCCC GGCGCGCCTC TCGGCGTGGC CCTCGTACCT CAACTCGAAG
TTTGCGCCTC CCGGCGCGAA ATACGTCGAG CAGGCAATGC GCACCATCGC CAGCCACGAC
ATTATCGATC AACCGACCGG TCCGCAGGTC ACGCAGGCGG CAGGACCGAT CTGGGATCTG
GTGGTGGCCG GGCAGTTGAG CGTGAAGGAA GCGTGTGATC AGGTGTTTGC TGCGGTGGAC
CCGATTATCG CCGTCAATCG CGCGTGA
 
Protein sequence
MTTHHEEELA QELKRRGLSR REILQIGARL GLSSAAVAAV LAACGQAPQQ SGTGAAPTTP 
PATEPTPTSL YDLEEGNSGW PTNAIADPSE RVEISVAHAW DAVFFERQKQ FDTLFMQRHP
NIVVKAENTP FGEYRQKYVA QAAGNALPDI MYCQFSWAQE FIKNGLFRPL DDYIAKEKDF
NLQDFTPQSL VSYQRDGKLW GIPYDEGPAN LYYNKDIFDA AGIPYPDETW DLEKLKEVAL
KLTQGEGPNK IFGLGELPSL GDSLVAPPYL MPFGAQYLRE PKEDECLINQ PEAVAALEWW
QELRDKGAVP SPADLQNVAW PAFQFGKIAM TMQGSWATPP IRAGAKFNWD IAMWPKGPKA
HVTFSAGSAY MITRDSKNPD AAWIYLNEYL STAGQSYMWG ITGRGSPARL SAWPSYLNSK
FAPPGAKYVE QAMRTIASHD IIDQPTGPQV TQAAGPIWDL VVAGQLSVKE ACDQVFAAVD
PIIAVNRA