Gene Rcas_0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0919 
Symbol 
ID5538385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1203566 
End bp1206310 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content62% 
IMG OID640893069 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001431052 
Protein GI156740923 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA CGGGAGTCGC TCTCACCAAC GACGTGGAGG CGCGCATCGA AGCGCTGCTG 
CGACAGATGA CGCTGGCGGA GAAGGTCGCG CTGATGGCAG GATCGAGCAT GTGGACGACC
ACTCCCATCG AGCGGTTGGG GATTCCTGCG ATCAAAGTCA CGGACGGACC GAACGGCGCG
CGGGGTGCAG GTGGATTCGT CGGTGGCGCC GTTACGGCAG CGTGCTTCCC TGTAGGAATT
GCGCTGGCCG CGACATGGAA CAGCAGGCTG GTGGAAGAGG TTGGCGAGGC GCTCGCCGAA
GAAGCGCAAT CCAAAGGCGC TCGCCTCCTG CTGGCGCCGA CCGTTAACAT CCATCGTTCG
CCGCTCAATG GGCGCAACTT CGAGTGCTAT TCCGAAGACC CGTATCTCTC GGCGCGCATG
GCGGTCGCCT ATATCACCGG GTTGCAGCGG CGCGGCGTTG GCGCGACGAT CAAACACTAC
GTCTGCAACG ACTCGGAGTT CGAGCGGAAC ACGATCAGTT CTGAGGTCGA TGAACGCACA
TTGCGCGAGA TCTATCTGCC TCCCTTTCGC GCTGCCGTGC AGGAGGCGAA AACCTGGGCG
GTCATGGCGG CGTACAATCG TGTCAATGGG GTGTATGCCA GCGAGCATCC GGTATTGCTC
AACGATATCC TGAAGCGCGA ATGGGGATTC GATGGCATTG TGATGTCCGA CTGGTTCGGC
ACGAAGAGCG TCGTCGAGGC TGCCGCCAAC GGGCTGGACC TTGAAATGCC GGGACCAACG
CGCTGGCGCG GTGAGCGATT AGTCGCCGCC GTCGAGAATG GTCAGGTGCG TATGGAAGCC
ATCGATGAGT CGGCTTGTCG AATATTGCGC ACGATTGCGC GCGCGGGGGC GTTCGAGACA
CCGGAGATTC CCCCTGAGCA GGCGATTGAT CGCCCTGAGC ACCGGGCGCT GATCCGCCGT
GCTGCCGCCG AGAGCATGGT GCTGCTCAAG AACGATGGCG GCATCCTGCC GCTCAATCTG
GCGAACCTGT CGTCGATTGC GATCATCGGA CCCAACGCGA AGACGGCACA GATCATGGGT
GGCGGGAGCG CACAGGTCAA CGCGCACTAC GCCATTTCGC CCTACGACGG CATTGCGGCG
CGAGTCGGCG GGCAGGTGAT CCTGGAGTAC GAGATCGGTT GCACGAACCA TCGACACCTT
CCGCGCTTCG ATAGCCGATT GGTGACGCCG GAGAGCGGCG AGGGGCGCGG CTTTACCGTC
GCTTACTACA ACACCCACGA CCTGTCCGGT GAGCCGGTTC ATCAGGCGGC GACCGAGAGC
AGCGAGCAGG TCTGGCTAGG GGAGGTGGCG CCGGGCGTCG ATCCACGCCA GTTTTCGGCG
CGTTTCACCG CGCGGTTTAC GCCCGCCGAA CGTGGAACGC ATACCTTTAG CCTGATCAGC
GCCGGGCTGA GTCGCCTCTT TGTCGACGAC GTACTGATCA TCGACAACTG GACGACCCAG
ACACGCGGGG ATGCGTTCTT TGGCGCAGGA AGCGCCGAAG CGACGGCGCC GATGACGCTG
GAAGCCGGTC GAACCTACGC GCTTCGCCTG GAATATAGTA ATCAGGGCGC GACCATGCTT
GCCGCTGTGC GGCTCGGCTA TCTGCCGCCG GTTGCAGAAG ACGCCATCGA GCGCGCCGCA
GCCCTGGCGG CGCAATCCGA CGTTGCGCTG GTGTTTGTCG GGCTGAATGC CGATTGGGAG
AGCGAAGGGT ATGATCGCCC GCATATGGAC CTGGTCGGCA GGCAGGACGA ACTGGTCGAG
CGCGTGGCAG CCGCCAATCC GCGCACGATT GTCGTGCTGC AAACCGGTTC GCCGGTGACG
ATGCCGTGGC TGGATCGGGT GGCGGCGGTC CTTCAGGCGT GGTATCCCGG TCAGGAATGC
GGTAACGCGA TTGCCGACGT GTTGTTTGGC GATGTTAACC CCTCGGGCAG ACTGCCGCAG
ACTTTTCCGG TTCGATTGGA AGACAATCCG GCATACATCA ACTATCCCGG TGAGAACGGG
CGGGTGCGCT ACGGTGAAGG TATCTTCGTC GGCTACCGCT ACTACGAGAA GAAAAAGGTT
GCGCCGCTGT TTCCCTTTGG CTTCGGTCTT TCGTATACCA CGTTCCGCTA CGATAACCTG
CGCCTGAGCG CCGATGTCAT TGCTCCCGAT GATCGGCTCA CGGCGCAGAT CGACATCACC
AACACCGGGA TGGTCGCCGG TCAGGAAGTG GTGCAACTGT ACGTGCGCGA CAGCGCCGCG
CGCGTCGCCC GACCGGCAAA GGAGTTGAAA GGGTTTGTCA AAGTCGCGCT GCAACCGGGC
GAGACACAAA CGGTGACCTT CTCGCTTGAT CGGGAGGCGC TGGCGTACTG GGACGACGTC
CAGCATGCCT GGGTCGCCGA GGCAGGTGAG TTCGAGGTTC TTGTGGGAAG TTCATCGCAG
GACATCCGGG CGCGCGCGGT GTTTCATCTG AATGATACTG TCGCCTTCGG CGGACCGACA
AAGTCGCCGG TGCAACTGAG TGTCGACTCG CCGGTCAAGG CGTTGATCGA ACACGACGGT
GCGCGTGCAG TGCTGGAACG CCACATGCCC GGTTTTGTCG AACAGGCTGG CGTCGGTGTC
ATGATGGGGC TGACGCTGGC GCAGATGGCA GCATTCGCAG CGGATCGGAT CACGCCGGAA
CTGTTGGGCG CGATTGCCGC AGACCTGGCG CGGATTCAGG CATGA
 
Protein sequence
MTDTGVALTN DVEARIEALL RQMTLAEKVA LMAGSSMWTT TPIERLGIPA IKVTDGPNGA 
RGAGGFVGGA VTAACFPVGI ALAATWNSRL VEEVGEALAE EAQSKGARLL LAPTVNIHRS
PLNGRNFECY SEDPYLSARM AVAYITGLQR RGVGATIKHY VCNDSEFERN TISSEVDERT
LREIYLPPFR AAVQEAKTWA VMAAYNRVNG VYASEHPVLL NDILKREWGF DGIVMSDWFG
TKSVVEAAAN GLDLEMPGPT RWRGERLVAA VENGQVRMEA IDESACRILR TIARAGAFET
PEIPPEQAID RPEHRALIRR AAAESMVLLK NDGGILPLNL ANLSSIAIIG PNAKTAQIMG
GGSAQVNAHY AISPYDGIAA RVGGQVILEY EIGCTNHRHL PRFDSRLVTP ESGEGRGFTV
AYYNTHDLSG EPVHQAATES SEQVWLGEVA PGVDPRQFSA RFTARFTPAE RGTHTFSLIS
AGLSRLFVDD VLIIDNWTTQ TRGDAFFGAG SAEATAPMTL EAGRTYALRL EYSNQGATML
AAVRLGYLPP VAEDAIERAA ALAAQSDVAL VFVGLNADWE SEGYDRPHMD LVGRQDELVE
RVAAANPRTI VVLQTGSPVT MPWLDRVAAV LQAWYPGQEC GNAIADVLFG DVNPSGRLPQ
TFPVRLEDNP AYINYPGENG RVRYGEGIFV GYRYYEKKKV APLFPFGFGL SYTTFRYDNL
RLSADVIAPD DRLTAQIDIT NTGMVAGQEV VQLYVRDSAA RVARPAKELK GFVKVALQPG
ETQTVTFSLD REALAYWDDV QHAWVAEAGE FEVLVGSSSQ DIRARAVFHL NDTVAFGGPT
KSPVQLSVDS PVKALIEHDG ARAVLERHMP GFVEQAGVGV MMGLTLAQMA AFAADRITPE
LLGAIAADLA RIQA