Gene Rcas_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0521 
Symbol 
ID5537984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp679575 
End bp680936 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content62% 
IMG OID640892683 
ProductBeta-glucosidase 
Protein accessionYP_001430669 
Protein GI156740540 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATAC GACACTTCCC CGACGATTTT CTCTGGGGCG CTGCGACGGC TGCCTTTCAA 
ATCGAAGGCG CCACCCGCGA AGATGGACGC GGCGAGTCGA TCTGGGACCG GTTCTGTGCA
ACGCCAGGCA AGGTGCTCAA CGGCGACACC GGCGATCCCG CCTGCGATCA CTACCACCGT
TGGCGCGACG ACATCGCGCT CATGAAATCA CTCGGCTTTC CGGCATATCG GTTTTCAATC
GCCTGGTCCC GTATTATGCC CAAGGGGAGA GGCGCAGTCA ATCCTGCCGG TCTCGATTTT
TACGACCGCC TGGTTGATGG TTTGCTGGCA GCAGGCATTC GCCCGTTTGT GACATTGTAC
CACTGGGATC TGCCACAGGC GCTCGAAGAT GCTGGCGGTT GGCCCGCGCG CGATACCGCC
GCTGCGTTCG CCGACTATGC CGATGTTGTG GCGCGACGCC TGGGGGATCG TGTGAAACAC
TGGATCACGC TCAACGAACC GTGGTGTTCC GCATTTCTCG GCTATTGGAC CGGTGATCAC
GCGCCGGGGC GGAAGGAAGG ACCGGCGCTT GCTGCGGCGC ACCACCTGCT CCTCGGTCAT
GGGCTGGCGC TCGCCGCTCT TCGCGCTGCA CACTCCGACG TTCGGGCGGG CATTACTCTC
AACTTTTCGC CTGCTGACCC GGCGAGTGAT AGCGATGCGG ATCGCGCGGC GGCGTGGCGG
TACGATGGCT TTTTCAACCG CTGGTACCTC GATCCGCTCT ATCGCAGCGC CTATCCCGCC
GACATGCTGG CGCTCTATGC GCAGATGGGG CAGGCGCCGC CGGTGCAAGA CGACGATATG
CGCATCATCG CTGCGCCGCT CGATTTTCTG GGGGTGAACT ACTACTCGCG CGCCGTCATT
CGCGACGATC CGCAGGCTGG CGGTCTCAGG TACGCACACA AGCGACCGGA AGGCGAGTAC
ACCCAGATGG ATTGGGAAGT TCATCCCGCT TCGCTGCGCC GACTGCTGGA GCGATTGCAC
CGTGATTACG CGCCGACGAC GCTGTACATA ACTGAAAACG GCGCCGCCTA TCCAGACGAA
GTCTCATCCG ACGGCGGCGT CCACGACCCG GATCGCGTGC GCTACATCGC GCGTCATCTG
GCGGCATGCC ACGATGCCAT CGCTGCCGGA GTTCCGCTGC GCGGATACTT CGTCTGGTCG
TTAATGGACA ACTTCGAGTG GGCATTCGGT TATAGCCGCC GATTCGGTAT TGTGTACGTG
GACTACGCCA CTCAGCGGCG CATTCCAAAG GACTCGGCGC TGTTCCTGCG CCAGGTGATC
GCCGCAAATG CGTTGACAGA GACGCAGATG TTTACGAGGT GA
 
Protein sequence
MAIRHFPDDF LWGAATAAFQ IEGATREDGR GESIWDRFCA TPGKVLNGDT GDPACDHYHR 
WRDDIALMKS LGFPAYRFSI AWSRIMPKGR GAVNPAGLDF YDRLVDGLLA AGIRPFVTLY
HWDLPQALED AGGWPARDTA AAFADYADVV ARRLGDRVKH WITLNEPWCS AFLGYWTGDH
APGRKEGPAL AAAHHLLLGH GLALAALRAA HSDVRAGITL NFSPADPASD SDADRAAAWR
YDGFFNRWYL DPLYRSAYPA DMLALYAQMG QAPPVQDDDM RIIAAPLDFL GVNYYSRAVI
RDDPQAGGLR YAHKRPEGEY TQMDWEVHPA SLRRLLERLH RDYAPTTLYI TENGAAYPDE
VSSDGGVHDP DRVRYIARHL AACHDAIAAG VPLRGYFVWS LMDNFEWAFG YSRRFGIVYV
DYATQRRIPK DSALFLRQVI AANALTETQM FTR