Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0919 |
Symbol | |
ID | 5538385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1203566 |
End bp | 1206310 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893069 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001431052 |
Protein GI | 156740923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACA CGGGAGTCGC TCTCACCAAC GACGTGGAGG CGCGCATCGA AGCGCTGCTG CGACAGATGA CGCTGGCGGA GAAGGTCGCG CTGATGGCAG GATCGAGCAT GTGGACGACC ACTCCCATCG AGCGGTTGGG GATTCCTGCG ATCAAAGTCA CGGACGGACC GAACGGCGCG CGGGGTGCAG GTGGATTCGT CGGTGGCGCC GTTACGGCAG CGTGCTTCCC TGTAGGAATT GCGCTGGCCG CGACATGGAA CAGCAGGCTG GTGGAAGAGG TTGGCGAGGC GCTCGCCGAA GAAGCGCAAT CCAAAGGCGC TCGCCTCCTG CTGGCGCCGA CCGTTAACAT CCATCGTTCG CCGCTCAATG GGCGCAACTT CGAGTGCTAT TCCGAAGACC CGTATCTCTC GGCGCGCATG GCGGTCGCCT ATATCACCGG GTTGCAGCGG CGCGGCGTTG GCGCGACGAT CAAACACTAC GTCTGCAACG ACTCGGAGTT CGAGCGGAAC ACGATCAGTT CTGAGGTCGA TGAACGCACA TTGCGCGAGA TCTATCTGCC TCCCTTTCGC GCTGCCGTGC AGGAGGCGAA AACCTGGGCG GTCATGGCGG CGTACAATCG TGTCAATGGG GTGTATGCCA GCGAGCATCC GGTATTGCTC AACGATATCC TGAAGCGCGA ATGGGGATTC GATGGCATTG TGATGTCCGA CTGGTTCGGC ACGAAGAGCG TCGTCGAGGC TGCCGCCAAC GGGCTGGACC TTGAAATGCC GGGACCAACG CGCTGGCGCG GTGAGCGATT AGTCGCCGCC GTCGAGAATG GTCAGGTGCG TATGGAAGCC ATCGATGAGT CGGCTTGTCG AATATTGCGC ACGATTGCGC GCGCGGGGGC GTTCGAGACA CCGGAGATTC CCCCTGAGCA GGCGATTGAT CGCCCTGAGC ACCGGGCGCT GATCCGCCGT GCTGCCGCCG AGAGCATGGT GCTGCTCAAG AACGATGGCG GCATCCTGCC GCTCAATCTG GCGAACCTGT CGTCGATTGC GATCATCGGA CCCAACGCGA AGACGGCACA GATCATGGGT GGCGGGAGCG CACAGGTCAA CGCGCACTAC GCCATTTCGC CCTACGACGG CATTGCGGCG CGAGTCGGCG GGCAGGTGAT CCTGGAGTAC GAGATCGGTT GCACGAACCA TCGACACCTT CCGCGCTTCG ATAGCCGATT GGTGACGCCG GAGAGCGGCG AGGGGCGCGG CTTTACCGTC GCTTACTACA ACACCCACGA CCTGTCCGGT GAGCCGGTTC ATCAGGCGGC GACCGAGAGC AGCGAGCAGG TCTGGCTAGG GGAGGTGGCG CCGGGCGTCG ATCCACGCCA GTTTTCGGCG CGTTTCACCG CGCGGTTTAC GCCCGCCGAA CGTGGAACGC ATACCTTTAG CCTGATCAGC GCCGGGCTGA GTCGCCTCTT TGTCGACGAC GTACTGATCA TCGACAACTG GACGACCCAG ACACGCGGGG ATGCGTTCTT TGGCGCAGGA AGCGCCGAAG CGACGGCGCC GATGACGCTG GAAGCCGGTC GAACCTACGC GCTTCGCCTG GAATATAGTA ATCAGGGCGC GACCATGCTT GCCGCTGTGC GGCTCGGCTA TCTGCCGCCG GTTGCAGAAG ACGCCATCGA GCGCGCCGCA GCCCTGGCGG CGCAATCCGA CGTTGCGCTG GTGTTTGTCG GGCTGAATGC CGATTGGGAG AGCGAAGGGT ATGATCGCCC GCATATGGAC CTGGTCGGCA GGCAGGACGA ACTGGTCGAG CGCGTGGCAG CCGCCAATCC GCGCACGATT GTCGTGCTGC AAACCGGTTC GCCGGTGACG ATGCCGTGGC TGGATCGGGT GGCGGCGGTC CTTCAGGCGT GGTATCCCGG TCAGGAATGC GGTAACGCGA TTGCCGACGT GTTGTTTGGC GATGTTAACC CCTCGGGCAG ACTGCCGCAG ACTTTTCCGG TTCGATTGGA AGACAATCCG GCATACATCA ACTATCCCGG TGAGAACGGG CGGGTGCGCT ACGGTGAAGG TATCTTCGTC GGCTACCGCT ACTACGAGAA GAAAAAGGTT GCGCCGCTGT TTCCCTTTGG CTTCGGTCTT TCGTATACCA CGTTCCGCTA CGATAACCTG CGCCTGAGCG CCGATGTCAT TGCTCCCGAT GATCGGCTCA CGGCGCAGAT CGACATCACC AACACCGGGA TGGTCGCCGG TCAGGAAGTG GTGCAACTGT ACGTGCGCGA CAGCGCCGCG CGCGTCGCCC GACCGGCAAA GGAGTTGAAA GGGTTTGTCA AAGTCGCGCT GCAACCGGGC GAGACACAAA CGGTGACCTT CTCGCTTGAT CGGGAGGCGC TGGCGTACTG GGACGACGTC CAGCATGCCT GGGTCGCCGA GGCAGGTGAG TTCGAGGTTC TTGTGGGAAG TTCATCGCAG GACATCCGGG CGCGCGCGGT GTTTCATCTG AATGATACTG TCGCCTTCGG CGGACCGACA AAGTCGCCGG TGCAACTGAG TGTCGACTCG CCGGTCAAGG CGTTGATCGA ACACGACGGT GCGCGTGCAG TGCTGGAACG CCACATGCCC GGTTTTGTCG AACAGGCTGG CGTCGGTGTC ATGATGGGGC TGACGCTGGC GCAGATGGCA GCATTCGCAG CGGATCGGAT CACGCCGGAA CTGTTGGGCG CGATTGCCGC AGACCTGGCG CGGATTCAGG CATGA
|
Protein sequence | MTDTGVALTN DVEARIEALL RQMTLAEKVA LMAGSSMWTT TPIERLGIPA IKVTDGPNGA RGAGGFVGGA VTAACFPVGI ALAATWNSRL VEEVGEALAE EAQSKGARLL LAPTVNIHRS PLNGRNFECY SEDPYLSARM AVAYITGLQR RGVGATIKHY VCNDSEFERN TISSEVDERT LREIYLPPFR AAVQEAKTWA VMAAYNRVNG VYASEHPVLL NDILKREWGF DGIVMSDWFG TKSVVEAAAN GLDLEMPGPT RWRGERLVAA VENGQVRMEA IDESACRILR TIARAGAFET PEIPPEQAID RPEHRALIRR AAAESMVLLK NDGGILPLNL ANLSSIAIIG PNAKTAQIMG GGSAQVNAHY AISPYDGIAA RVGGQVILEY EIGCTNHRHL PRFDSRLVTP ESGEGRGFTV AYYNTHDLSG EPVHQAATES SEQVWLGEVA PGVDPRQFSA RFTARFTPAE RGTHTFSLIS AGLSRLFVDD VLIIDNWTTQ TRGDAFFGAG SAEATAPMTL EAGRTYALRL EYSNQGATML AAVRLGYLPP VAEDAIERAA ALAAQSDVAL VFVGLNADWE SEGYDRPHMD LVGRQDELVE RVAAANPRTI VVLQTGSPVT MPWLDRVAAV LQAWYPGQEC GNAIADVLFG DVNPSGRLPQ TFPVRLEDNP AYINYPGENG RVRYGEGIFV GYRYYEKKKV APLFPFGFGL SYTTFRYDNL RLSADVIAPD DRLTAQIDIT NTGMVAGQEV VQLYVRDSAA RVARPAKELK GFVKVALQPG ETQTVTFSLD REALAYWDDV QHAWVAEAGE FEVLVGSSSQ DIRARAVFHL NDTVAFGGPT KSPVQLSVDS PVKALIEHDG ARAVLERHMP GFVEQAGVGV MMGLTLAQMA AFAADRITPE LLGAIAADLA RIQA
|
| |