Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1932 |
Symbol | |
ID | 5539410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2473023 |
End bp | 2474318 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640894068 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001432039 |
Protein GI | 156741910 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.536159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000905056 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGATCC AGTACGATAT TGGTCTTTCC ACCCACACCC GCCAGCACAT CCGCTTGCCT GAACCGCTCC GCTTCCCGCC AGGCTTTCTA TGGGGAACCG CCACCAGCGC GCATCAGGTT GAAGGGCAGA ATACCAATAA CCAGTGGTGG GTTTGGGAGC AGCAAGGGCG CTGTTGGCAT GGTGACGTTT CCGGTGATGC CTGTGGTTGG TGGCGCGACG CTGAAGGCGA CCTGGATCGA GCCGCTGCGC TTGGCACAAA CGCCCATCGT ATGTCTATCG AATGGAGTCG CATCGAGCCG GAAGAAGGAC GCTTCGACCG TCGCGCGATC CGCCGCTATC GCAACATCAT CGGCGGCATC ATCAGGCGCG GGATGACGCC GATGATTACG CTCCACCATT TCACCAATCC GCTCTGGATC GAGGCGCGAG GCGCATGGTT GAACCCGGCA ACGCCCAGAC GGTTTGCGCA ATTCGTCGCG TATGCAGTCG AAGAATTGGG CGATCTCTGT AATCTCTGGT GCACCGTCAA CGAACCGACG GTCTACGCGG CGTTGAGTTA TCTCCAGGGT GTCTGGCCCC CTGGACGGCG CAATATTATC CAGGCGTTGC GGGTCTTCGC CAATTTGATG CGCGGGCACG AACTAGCGGC GCAGACGGTG CGCAAGCAGC ATCCGGCGCA CCGTGTTGGC ATTGTGCATC ACAAGCGGGT CCTTGATCCG GCTTCGCCTG CCGGTCACGA TGTGCTGACG ACGGTGATGT ATGATTATCT GGTCAATGGA CTGGTGTTGC GGCGGCTGCG CGAAACGTCC GATTTCTTCG GGCTGAATTA CTATAGCCGC GATCACATCG CCTTTGACCT GCGCCGTCCG TACCATCTGT TCATTCGCCG CTTCACGCCG CCGCACTTTG AGCAGAGCGA CGCGGGCATG GAGGGCGCGT TTGGCGAAAT CTATCCCAAC GGTCTGTACC GCGCGCTCAA ACGGGTCTAC CGCTGGCTGA AACTACCGAT CTATGTGACC GAAACCGGTC TGCCGGATGC GGACGATAAT CAGCGCCCGC GCTTTCTGCT CAATCACCTG GAATCGGTGC ATCGCGCGAT CCAGGAGGGT GTCGATGTGC GTGGCGTCTT CGTCTGGTCG CTGGTGGATA ATTTTGAGTG GGCAGAGGGA TGGGGGCTGC GCTTCGGACT GTATGCGCTC GATGAGCGCA CCGGTGAGCG GCGGATGCGT CCTTCCGCTG CGCTCTACGC GATCATCACC CGCGCAAATG CGATTCCTGC GCCGGGGGCG TTGTGA
|
Protein sequence | MTIQYDIGLS THTRQHIRLP EPLRFPPGFL WGTATSAHQV EGQNTNNQWW VWEQQGRCWH GDVSGDACGW WRDAEGDLDR AAALGTNAHR MSIEWSRIEP EEGRFDRRAI RRYRNIIGGI IRRGMTPMIT LHHFTNPLWI EARGAWLNPA TPRRFAQFVA YAVEELGDLC NLWCTVNEPT VYAALSYLQG VWPPGRRNII QALRVFANLM RGHELAAQTV RKQHPAHRVG IVHHKRVLDP ASPAGHDVLT TVMYDYLVNG LVLRRLRETS DFFGLNYYSR DHIAFDLRRP YHLFIRRFTP PHFEQSDAGM EGAFGEIYPN GLYRALKRVY RWLKLPIYVT ETGLPDADDN QRPRFLLNHL ESVHRAIQEG VDVRGVFVWS LVDNFEWAEG WGLRFGLYAL DERTGERRMR PSAALYAIIT RANAIPAPGA L
|
| |