Gene Rcas_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3244 
Symbol 
ID5540742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4216223 
End bp4218973 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content63% 
IMG OID640895365 
Productglycoside hydrolase family protein 
Protein accessionYP_001433316 
Protein GI156743187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0368186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCT ATCACATGCC GCTCCCTGGT CCCTGGGAGT TTCGTTTTGA CGATCAATCC 
GACTGGCGCC CCATTGCGGT TCCCGGATGC TGGGAAGACG CTGGTTTTGT AAAAGACCGC
TCTGGTCCGG CATGGTATCG CACCTCATTC GTAATTCCGC ACGAACTGGA AGGACGGCGC
CTCTTTCTGC ACTTTGGCGC CGTCAGTTAT CACTGTGAAG CATTCATCGT GCGCGATTCG
GGTGATATGT GCAGCATCGG CACGCACACC GGCATGTGGG ATGCGTTCGA CCTCGAAATT
GGCGATGCCG CAGCGCCGGG TGAGCATGTC ACGTTGCTGG TGCGGGTCGA GAAACCGGCA
AGTCTGAGCG ACGGACCGGA GTCGGCATCG CTGCCAGGGC GCTTCCCGCT GCGCGAAACG
CTCGCCGGAT TTCTCCCGTA TGTCTGGGGG CATAGCCACG GCGGAATCTG GCAGGAAGTC
GCCCTGATCG CCTGCGGTGC GACGCGCTTT CTCGATGCCT GGGTGCATGG CGCGCCTGAT
GGTCACATCT TCGTGGAAGC GGAACTCGAT GGTCCGGCAC GTGTCACGCT GGAACTCTAC
CGTCCAGATG GCAGTTTCAT CCTCTCGGCA GAAGAAGATG CTGTGCGTGT GGAAACCGCG
AGCGGCATTC GCTACCGGCT GCACATGGAC GGCCCGACCT CCGACGCACG CCTCTGGTCG
CCGGACGATC CGGCGCTTTA CCGGGCTGTG CTGCGCGCCG GTTACGATGA TCGCATCGAA
CTGCGCTTTG GGTTGCGCTC ATTCGACGCG GATGGCGCAA CGCTGCGACT TAACGGCGCA
CCGATCTACC CCCGCATGAT CCTGTCGTGG GGCTGGCGCT GGGAGACCTT CGCCCCCAAT
CCGGGACCGG AACGGGTGCG CGCCGATTTC GAGCGCCTGA AACGCATGGG GTACAACGGC
ATCAAGTGTT GTCTCTGGTT CCCGCCTCGC TACTACTTCG ACCTGGCGGA CGAACTGGGC
ATGCTGTTGT GGATCGAGTT GCCGATGTGG TTGCCTCGCG TAACCGACCA TTTCCGTCGT
CAGACGCCGG TTGAATACGA ACGCCTGGTT CGTCAGGCGC GGCGACATCC ATCGGTAGTC
CTCTACAGCC TGGGATGCGA ACTGGGAAAG GACGTCGGCG CCGATATTCT CGGCTCGCTG
TACGCGATGA CGCGCAGTAT GTGCGGCGAC GCGCTTGTGC GCGATAACAG CGGCTCCGGC
GAAGCCTACG GCGGTCTGCT CAACGAGTTC GCTCAGTACT ACGACTACCA CTTCTATGCC
GACCTGCACT TCTTTCGTGG TCTTCTCGAC GCCTTTTCGC CGCGCTGGCG CCCGGCACAA
CCCTGGTTGT TCGGTGAATA CTGCGATTAC GATACCTTCC GTGATCTGCG GCGCTACCGC
CGGGCGGATG GTTCACGCCC CTGGTGGTTG AGCGCCGATC TGGCGATCAA TCCAACCGGC
GCGCGCTGGA CGAACGAAGC GCCCTTTCTC GAAGAGCGGC TGCGCGCACA GGGGTTGTGG
GAGCGCAGTG CTGAACTCGA AGCATTCTCG TATGCGCATG GCTTGCTTCA CCGCAAATGG
ACAATCGAGA CGACGCGCAT GTATCGTGAG GTTTCCGGCT ATGTCATCAC CGGCGAAGCC
GACACGCCGA TCACCAGCGC TGGCATGTGG GATGCAACCG GCGCCCTCAA GTACGATCCC
ACCGAGTTCC GGCGTTTCAA CAACGATCTG GTCGCACTTA TCGGATGGGA CCGACGACGC
GACTGGGTGC GCGGCGGCGA CCGCGCGGCA TTCTGGGATG TCTGGAGTTA TAACGCCGGA
ACGCTCGTCC GTCCCCACGT GATTGTATCG CACTATGGGG CGCAGGGCGG ACCGGCGCGC
GCTGCCTGGA GCATCGCCTT CGATGACGAA GCGCCCTTTG CGTCTGGTGA TATCGTCGCC
AGCCACGATG TTATGCCCGG CGACGTGCGC GAAATCGGCG TAGCGGAATT TACCGCACCC
GATGTGAGCG CACCACTGCG CGCCATGTTC CGGGTGTCGC TTAATGTCGG AACGCAGCGG
ACGGACAATG CCTGGCCAAT CTGGTTCTTT CCTGCGAACC CGTGGGCAAC CATGCGCAAT
GTGGCGATCT ACGATCCGCT GGGACGGTTG CGCGATCTCA CCCGACTGGC GCCACAGGTG
GTTGAAGTGA CGCACGCCGA TCTGCGGAGC GAAGCCGGAG CGCGCGTCGA TCTGCATGCA
GCGCCGTTCG TCGTTGTCGC CAGCGCCTGG ACACGCGCAC TATCGATGTA TACTCGCGGA
GGCGGGCGTG TCGTGCTGCT CCAGGACGGC GACGGTCCGC CGGGACCGGT TGCAACGGCG
GCTATGCCGT TCTGGCGCGA GGCGCTGCGG GTCTGTGAGC CGCACCCGGC GTGGGGTGAT
TTTCCGCACG ATGGATGGGC TGGTCTCCAA TTCTTTGGCT GCGCCACCGA CTGCGCGCTC
GATACGCAGC CACTCGACGG ACTGACGCGC CCTATTCTGC GCCGTATCGA CACGCGCACC
ACCGCCGTTC ACGACTATGC CGCCGAGGTT GTATGGGGCG AGGGACGATT GATCGTCAGC
ACTCTGCGCA TCTACGGCGG CGCCGGTGAG CAGCCTTCCG GCATCGGGCG CAATACGTCG
GCTGCGTATT TACTGCTGTG CTGGGTGAAG TATCTGGGGA GCAAGAGGTG A
 
Protein sequence
MLSYHMPLPG PWEFRFDDQS DWRPIAVPGC WEDAGFVKDR SGPAWYRTSF VIPHELEGRR 
LFLHFGAVSY HCEAFIVRDS GDMCSIGTHT GMWDAFDLEI GDAAAPGEHV TLLVRVEKPA
SLSDGPESAS LPGRFPLRET LAGFLPYVWG HSHGGIWQEV ALIACGATRF LDAWVHGAPD
GHIFVEAELD GPARVTLELY RPDGSFILSA EEDAVRVETA SGIRYRLHMD GPTSDARLWS
PDDPALYRAV LRAGYDDRIE LRFGLRSFDA DGATLRLNGA PIYPRMILSW GWRWETFAPN
PGPERVRADF ERLKRMGYNG IKCCLWFPPR YYFDLADELG MLLWIELPMW LPRVTDHFRR
QTPVEYERLV RQARRHPSVV LYSLGCELGK DVGADILGSL YAMTRSMCGD ALVRDNSGSG
EAYGGLLNEF AQYYDYHFYA DLHFFRGLLD AFSPRWRPAQ PWLFGEYCDY DTFRDLRRYR
RADGSRPWWL SADLAINPTG ARWTNEAPFL EERLRAQGLW ERSAELEAFS YAHGLLHRKW
TIETTRMYRE VSGYVITGEA DTPITSAGMW DATGALKYDP TEFRRFNNDL VALIGWDRRR
DWVRGGDRAA FWDVWSYNAG TLVRPHVIVS HYGAQGGPAR AAWSIAFDDE APFASGDIVA
SHDVMPGDVR EIGVAEFTAP DVSAPLRAMF RVSLNVGTQR TDNAWPIWFF PANPWATMRN
VAIYDPLGRL RDLTRLAPQV VEVTHADLRS EAGARVDLHA APFVVVASAW TRALSMYTRG
GGRVVLLQDG DGPPGPVATA AMPFWREALR VCEPHPAWGD FPHDGWAGLQ FFGCATDCAL
DTQPLDGLTR PILRRIDTRT TAVHDYAAEV VWGEGRLIVS TLRIYGGAGE QPSGIGRNTS
AAYLLLCWVK YLGSKR