Gene Rcas_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1932 
Symbol 
ID5539410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2473023 
End bp2474318 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID640894068 
Productglycoside hydrolase family protein 
Protein accessionYP_001432039 
Protein GI156741910 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.536159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000905056 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGATCC AGTACGATAT TGGTCTTTCC ACCCACACCC GCCAGCACAT CCGCTTGCCT 
GAACCGCTCC GCTTCCCGCC AGGCTTTCTA TGGGGAACCG CCACCAGCGC GCATCAGGTT
GAAGGGCAGA ATACCAATAA CCAGTGGTGG GTTTGGGAGC AGCAAGGGCG CTGTTGGCAT
GGTGACGTTT CCGGTGATGC CTGTGGTTGG TGGCGCGACG CTGAAGGCGA CCTGGATCGA
GCCGCTGCGC TTGGCACAAA CGCCCATCGT ATGTCTATCG AATGGAGTCG CATCGAGCCG
GAAGAAGGAC GCTTCGACCG TCGCGCGATC CGCCGCTATC GCAACATCAT CGGCGGCATC
ATCAGGCGCG GGATGACGCC GATGATTACG CTCCACCATT TCACCAATCC GCTCTGGATC
GAGGCGCGAG GCGCATGGTT GAACCCGGCA ACGCCCAGAC GGTTTGCGCA ATTCGTCGCG
TATGCAGTCG AAGAATTGGG CGATCTCTGT AATCTCTGGT GCACCGTCAA CGAACCGACG
GTCTACGCGG CGTTGAGTTA TCTCCAGGGT GTCTGGCCCC CTGGACGGCG CAATATTATC
CAGGCGTTGC GGGTCTTCGC CAATTTGATG CGCGGGCACG AACTAGCGGC GCAGACGGTG
CGCAAGCAGC ATCCGGCGCA CCGTGTTGGC ATTGTGCATC ACAAGCGGGT CCTTGATCCG
GCTTCGCCTG CCGGTCACGA TGTGCTGACG ACGGTGATGT ATGATTATCT GGTCAATGGA
CTGGTGTTGC GGCGGCTGCG CGAAACGTCC GATTTCTTCG GGCTGAATTA CTATAGCCGC
GATCACATCG CCTTTGACCT GCGCCGTCCG TACCATCTGT TCATTCGCCG CTTCACGCCG
CCGCACTTTG AGCAGAGCGA CGCGGGCATG GAGGGCGCGT TTGGCGAAAT CTATCCCAAC
GGTCTGTACC GCGCGCTCAA ACGGGTCTAC CGCTGGCTGA AACTACCGAT CTATGTGACC
GAAACCGGTC TGCCGGATGC GGACGATAAT CAGCGCCCGC GCTTTCTGCT CAATCACCTG
GAATCGGTGC ATCGCGCGAT CCAGGAGGGT GTCGATGTGC GTGGCGTCTT CGTCTGGTCG
CTGGTGGATA ATTTTGAGTG GGCAGAGGGA TGGGGGCTGC GCTTCGGACT GTATGCGCTC
GATGAGCGCA CCGGTGAGCG GCGGATGCGT CCTTCCGCTG CGCTCTACGC GATCATCACC
CGCGCAAATG CGATTCCTGC GCCGGGGGCG TTGTGA
 
Protein sequence
MTIQYDIGLS THTRQHIRLP EPLRFPPGFL WGTATSAHQV EGQNTNNQWW VWEQQGRCWH 
GDVSGDACGW WRDAEGDLDR AAALGTNAHR MSIEWSRIEP EEGRFDRRAI RRYRNIIGGI
IRRGMTPMIT LHHFTNPLWI EARGAWLNPA TPRRFAQFVA YAVEELGDLC NLWCTVNEPT
VYAALSYLQG VWPPGRRNII QALRVFANLM RGHELAAQTV RKQHPAHRVG IVHHKRVLDP
ASPAGHDVLT TVMYDYLVNG LVLRRLRETS DFFGLNYYSR DHIAFDLRRP YHLFIRRFTP
PHFEQSDAGM EGAFGEIYPN GLYRALKRVY RWLKLPIYVT ETGLPDADDN QRPRFLLNHL
ESVHRAIQEG VDVRGVFVWS LVDNFEWAEG WGLRFGLYAL DERTGERRMR PSAALYAIIT
RANAIPAPGA L