Gene Rcas_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0901 
Symbol 
ID5538367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1179506 
End bp1181878 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content61% 
IMG OID640893051 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001431034 
Protein GI156740905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.72185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACA CAACTACCAC GTATCCGTAT CAACACGCCG GTCTCCCCAT CGAACAGCGT 
GTCGAAGATC TGCTTGGTCG GATGACCGTC GAGGAGAAAG TCGCCCAACT GAGTAGCCGC
TGGATTTATG AGATTGCCGA TGATCGCGGG TTGAACCGGC AATGGGCGCA GGAGCGCATG
GCGCACGGCC TAGGACAGGT AACGCGACTC GCGGGCGGCA GCAGCCTGGG ACCGGTCGAG
ACAGCCAGAC TGGCGAATCA GATCCAGAAG TTTCTGGTCG AGGAGACACG GCTTGGCATC
CCGGCGCTGA TCCATGATGA GTGCTGTAGT GGTTTCCTTG CCAACGGCGC TACTAATTTC
CCGCAGATTA TCGGCATTGC CAGCGCCTGG GAGCCGGAAC TGGTCGAAGC AATGACTCGC
GTTATTCGCC AGCAGATGCG CGCCGTCGGC GTGCATCATG GGCTGGCGCC GGTGCTCGAC
ATCGCCCGCG ATCCACGCTG GGGACGCACC GAGGAGACAT TTGGCGAGGA CCCGTACCTG
ACATCAGTCA TGGGCGCCGC CTACATTCGC GGGCTACAAG GCGCCGACTG GTCCGAGGGC
GTGATGGCGA CGGGCAAACA TTTCGTAGGG TACAGCGCGT CGGAAGGCGG ACTCAACTGG
GCGCCGGCGC ATATCACGAC GCGCGAATTG CGCGAAGTCT ATCTGGCGCC GTTCGAGACG
GCAGTACGCG CCGCGCGGTT GGCGTCGATC ATGCCCGCCT ACCACGAAAT CGACGGCGAG
CCGTGCAGCG GCGCACACTG GTTGCTGACC GGCATTCTGC GCGATGAATG GGGATTTGAA
GGGTTGGTCG TTTCCGACTA TATGGCCATC GATCAGTTGC GCAACTATCA CAAACTGGCG
CGTGACAAGG CGCATGCCGC GCGACTGGCG CTCGAAGCGG GAATGGATAT TGAATTGCCA
AACGTCGAGG CTTACGGTCA ACCGCTGCTC GACGCCCTTG CTGCGGGTGA GATTCCGATG
GAGTGGGTGG ATCGTTCGGT GCGCCGTATT CTGACCTTGA AGTTCGCTTT TGGGCTGTTC
GAGAACCCCT ACGTGGATCC CGACGCCGTT CCTGCGGTGT TCGACACGCC AGCACAGCGT
GAACTGGCGC GCGAGATCGC GCGCAAGTCG ATTGTGCTGC TCAAGAACGA GGGCAATCGG
TTGCCGCTGC CCAAAACCCT GAGCGCTATC GCCGTCATCG GTCCCAACGC CGACAGCAAG
CGCAACCTGC TGGGCGACTA TTCTTATCCA GCGCATATCG AGACGCTGAT CACGCTGAGT
CAACTGGGTT TCTCTGAGCA TCCGCTGCCG GATTCGATCC GGCTCATCGA GAACGATTCT
TCGATGCTCT CCATCGTTGA AGCGATTCGT CGCACCGTGT CACCGACAAC GCAGGTCCTC
TACGCCAGGG GATGTGATGT CAATTCGCCA TCGACCGACG GTTTTGCCGA AGCAATCGAG
GCGGCGCGCA AAGCGGAGGT TGCCATCGTG GTCGTCGGCG ATAAGGCCGG TCTGACGCCA
GAGTGCACGT CGGGCGAATT CCGCGACAGT GCCCACCTGA CCCTCCCCGG TGTGCAGCAA
CAGTTGGTTG CAGCGATCCT GGCAACGGGA ACGCCGGTGG TGCTGGTGCT GGTGACCGGA
CGCCCCTATG CGATTCCGCA TCTTGTCGAC GCGACCCCTG CGGTCGTCGA AGCCTGGCTG
CCAGGAGCGG AGGGAGCGCC GGCATTGGCA GAGGCGCTCT TTGGCGACGT CAATCCCGGC
GGCAAGTTAC CCATCACCTT TCCGCGTCAC GTGGGACAGG TTCCGCTGTT CTACGCTCAT
CGCCCCTCTG GAGCGCGATC GTTCTTCTAT GGACCGTACA TGGATGAGAG CAATCAACCG
CTCTTTCCGT TCGGCTTTGG GCTGAGCTAC ACGCAGTTCG CATTCGAAAA CCTGACGGTC
ACGCCTGACG TGACGACCGA TGGCGAAGTG CAGGCATCGG TGGACGTGAT CAACACAGGA
GAGCGCAGCG GCGATGAGGT GGTGCAACTC TACACACGCA CCGAAGGCGC CAGCGTCACG
CGACCGGTCA AAGAACTGCG CGGCTTCAAG CGTGTGCATC TCCAACCGGG CGAACGAAAG
CGCGTGATCT TTACCTTGCC TGTCGAACTG CTCGCGTATT ATGATACGAC CATGCACCTC
GTCGTTGAGC CGGTTGAGGT GCACATTATG GTTGGCAACT CTTCGGCGCA CCTGCCATTA
TGTACATCGG TTCGCTTGAC CGGCACGAAG CGGCTGATAA CGCAGCGCGC AGCATATTTC
TGCCACGCGG CGGTGATGGA CAGCGGAGGT TGA
 
Protein sequence
MPDTTTTYPY QHAGLPIEQR VEDLLGRMTV EEKVAQLSSR WIYEIADDRG LNRQWAQERM 
AHGLGQVTRL AGGSSLGPVE TARLANQIQK FLVEETRLGI PALIHDECCS GFLANGATNF
PQIIGIASAW EPELVEAMTR VIRQQMRAVG VHHGLAPVLD IARDPRWGRT EETFGEDPYL
TSVMGAAYIR GLQGADWSEG VMATGKHFVG YSASEGGLNW APAHITTREL REVYLAPFET
AVRAARLASI MPAYHEIDGE PCSGAHWLLT GILRDEWGFE GLVVSDYMAI DQLRNYHKLA
RDKAHAARLA LEAGMDIELP NVEAYGQPLL DALAAGEIPM EWVDRSVRRI LTLKFAFGLF
ENPYVDPDAV PAVFDTPAQR ELAREIARKS IVLLKNEGNR LPLPKTLSAI AVIGPNADSK
RNLLGDYSYP AHIETLITLS QLGFSEHPLP DSIRLIENDS SMLSIVEAIR RTVSPTTQVL
YARGCDVNSP STDGFAEAIE AARKAEVAIV VVGDKAGLTP ECTSGEFRDS AHLTLPGVQQ
QLVAAILATG TPVVLVLVTG RPYAIPHLVD ATPAVVEAWL PGAEGAPALA EALFGDVNPG
GKLPITFPRH VGQVPLFYAH RPSGARSFFY GPYMDESNQP LFPFGFGLSY TQFAFENLTV
TPDVTTDGEV QASVDVINTG ERSGDEVVQL YTRTEGASVT RPVKELRGFK RVHLQPGERK
RVIFTLPVEL LAYYDTTMHL VVEPVEVHIM VGNSSAHLPL CTSVRLTGTK RLITQRAAYF
CHAAVMDSGG