Gene Rcas_0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0867 
Symbol 
ID5538333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1135682 
End bp1136674 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content61% 
IMG OID640893018 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001431001 
Protein GI156740872 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000788488 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTACCC TAACCTCACG ACACATGCCA CTTTTTCTGC TCATGCTCAT TGTTGCGCTT 
ACCGCCTGCA CTGCGCCGAC ATCCGCCCCG ACCGGCAACA ACGCGCCGCG TCAGTGGCGT
ATTGGAATGT CTCAGGCAAA CAACGCCGAG CCGTGGCGCC AGGCGATGAA TGACCAGATT
GCCGCCGCCG CTGCCCGTTA TCCGCACATC CGCATTGAGT TCACCGATGC ACGCCAGAAC
AATGCGCAAC AGGTCGCCGA TGTCGAGACC TTTCTCCAGC AAGGGATCGA CCTGCTGATC
ATTTCACCCA ACGAAGCCAC CCCGCTTACG AATATCGTCG CACAAGCGTT TCAGCGCGGC
ATTCCGGTGA TCGTGCTGGA TCGCAAGGTG AGGGGCGACC AGTACACTAT GTGGATCGGG
GCCGACAACC GCCTGATCGG TCGGAAAGCG GGTGAGTATA CGGCGCGCTG GTGCCGTGAA
CAGCAACGAT CGCCGTGCAA CGTGATCGAA CTGCGCGGTC TGGAAGGCTC CACACCGGCG
CAGGAACGCG GCGATGGCTT CCGCGAAGGG ATTGCTGGCA ACCCGGATGT GCGCATCATT
GCCAGCCAGA ATGCCGATTG GCTGGCCGAG CGCGCCGCAC CGCTATCGCG CGCGATGTTG
GAAGCCAACC CTGTGGTTGA TGTCGTCTAT GCCCACAACG ATCCTATGGC TATCGCGTCC
TTCACCATGG CGCGGGATCT GGGGCGTGAC CCGGACTCGA TCCTGTTCAT CGGCATCGAC
GCTCTTCCGA CGCCCGATGG CGGCATCCAG GCGGTGCGGC AGGGACAACT CGATGTCACC
TATGTGTACC CGACCGGTGG GGCGGAAGCT GTCGAGTGGG CGATTCGGAT ACTGGAGCGA
GGCGAAACGC CACCGCGCGA GGTTATTCTT GACACGGAAG AAGTGACCAG GGCGAATGCC
GACGCTCTAT GGCAAAAATA TGGAGGGCGG TAA
 
Protein sequence
MPTLTSRHMP LFLLMLIVAL TACTAPTSAP TGNNAPRQWR IGMSQANNAE PWRQAMNDQI 
AAAAARYPHI RIEFTDARQN NAQQVADVET FLQQGIDLLI ISPNEATPLT NIVAQAFQRG
IPVIVLDRKV RGDQYTMWIG ADNRLIGRKA GEYTARWCRE QQRSPCNVIE LRGLEGSTPA
QERGDGFREG IAGNPDVRII ASQNADWLAE RAAPLSRAML EANPVVDVVY AHNDPMAIAS
FTMARDLGRD PDSILFIGID ALPTPDGGIQ AVRQGQLDVT YVYPTGGAEA VEWAIRILER
GETPPREVIL DTEEVTRANA DALWQKYGGR