Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0991 |
Symbol | |
ID | 5207937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1213856 |
End bp | 1216600 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640594605 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001275350 |
Protein GI | 148655145 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.957169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCT ATCACATGCC GCTTCCCGGT CCCTGGGAAT TTCGTTTCGA CGATCAATCC GATTGGACGC CGATCACCGT TCCTGGCTGT TGGGAAGATG CCGGTTTCCC GAAGGATCGC GCCGGTCCGG CGTGGTATCG CACGACATTC GTTCTTCCGC AAGAACTGGC AGGACGTCGT CTCTTCCTGC GCTTTGGCGC CGTCAGTTAC CATTGTGAAG CATTCATTGC GCGCGATTCG GGCGAGATGC GCAGTATTGG CACACACACC GGCATGTGGG ACACATTCGA CCTGGCGCTT GGCGATGCAG CGGCAGCCGG TGAGTGCGTC ACACTGCTGG TGCGCGTTGA AAAACCGGCA AGCCTGACGG GAGGACCGGA GTCGCCGTCG CAGCCGGGGC GCTTCCCACT GCGCGAGACC CTCGCCGGTT TTCTGCCGTA TGTCTGGGGG CATAGCCACG GTGGAATCTG GCAGGAGGTT GCGCTGGTCG CTGCTGGCGG AACCCGCTTC CTCGATGCAT GGGTGCGCGG TTCAGCCGAT GGGTATGTCT TCGTGGAAGC AGAGCTCGAC GGTCCGGCGC GGGTAATGCT GGAGTTGTAC CGACCCGACG GCGGGTTCAT TCTGTCTGCC GAAGAAGATG CCGTGCGTGA GGACACGCCG ACTGGAGAAC GCTACATGCT GCGGATGAAC GGTCCTATTC CTGATCCGCG CCCCTGGTCG CCCGCCGACC CCGCGCTCTA CCGCGCTGTG CTGCGCGCTG GTCACGACGA CCGGGTTGAA CTGCGCTTCG GGTTGCGCTC GTTCGAGGCG GACGGAACAA TGCTGCGGCT CAACGACAAA CCAATCTACC CGCGCATGAT CCTGTCGTGG GGATGGCGCT GGGAAACATT CGCCCCCAAT CCCGGCGTGG AACGGGTGCG CGTCGATTTC GAGCGCCTGA AGCGCCTGGG GTACAACGGC ATCAAGTGCT GCCTGTGGTT CCCGCCGCAG TACTACTTCG ATCTGGCGGA CGAAATGGGC ATGCTGGTGT GGGTCGAGTT GCCGATGTGG CTGCCCCGCG TGACCGACCA CTTTCGCCAG CAGGTGTACG TCGAATACGA ACGCCTGATC CGCCAGGCGC GCCGTCATCC GTCGGTCGTG ATCTACAGCC TGGGGTGTGA ACTGGGAAAG GACGTCGGCG CCGACATCCT TGGCTCGCTG TACGCAATGA CGCGCAGTAT GTGCGGCGAG GCGCTGGTGC GCGATAACAG CGGTTCAGGC GAGGCGTATG GCGGCTTGCT CAACGAGTTT GCTCAGTACT ACGATTACCA TTTCTATGCC GACCTGCACT TCTTTCGCGG TCTTCTCGAC GCTTTTACAC CGCGCTGGCG CGGGGCGCAC CCCTGGCTGT TCGGCGAATA CTGCGACTAT GACACCTTTC GTGATCTGCG GCGCTACCGC AACGTTGATG GTTCGCGTCC CTGGTGGTTG AGCGCCGATG AGACGATCAA CCCGACCGGT GCACGCTGGA TGAACGAAGC GCCCTTCCTT GAGGAACGGC TGCGCGCCCA GGGATTGTGG GAGCGCAGCG CCGAACTCGA AGCCATTTCC TACGCCCATG GACTGCTTCA CCGCAAGTGG ACGGTCGAAC TGACACGCGC CTACCGCGAA ATTTCCGGCT ATGTCATCAC CGGTGAAGCG GATACGCCGA TCTCCAGCGC CGGTATGTGG GACGCAACCG GAGCGCTCAA ATACGATCCC ACCGAATTTC GACGCTTCAA CGGCGACCTG GTGGCGCTCA TCGGGTGGGA TCGGCGGCGC GATTGGGTGC GCGGCGGCGA TCGCGTGGCG TGGTGGGATG TCTGGAGTTA CACCGCCGGC GCGCTCGTCC GTCCACATCT GATCGCGTCG CACTACGGCG CGGAGAGCGG GCCGGCGCGC GCGGTCTGGA GCATTGCCTT CGACGACAGA CCGCCGTTCG CGTCCGGTGA AGTAACGACC GACCGCGACC TGATGCCGGG AGAGGTGCGC GAAATCGGCG TGGCGGAGTT CACTGCGCCC GACGTGAGCG CGCCACTGCG TGCAACCTTG CAGGTAACGC TGAGCGTCGG CGCCCAACAG ACAGAAAATG CGTGGTCACT CTGGTTCTTC CCCGCCGATT TCTGGGCATC CGTGCAGCAT GTCGCACTCT ACGATCCGAT AGGACGGCTG CGCGACCTGG CGCGTCTTGC GCCGCAGGCA GTGGAAGTCC TGCATGGCGA TCTGCGGGGT GACGCGGGAG CGCGCATCAA TCTGCGCGGA TCGACATTTG TTGTTGTCGC CAGCGCCTGG ACGCGCGCAT TATCGGCGTA TACCCGATCC GGAGGGCGTG TTGTGCTGAT CCAGGACGGC GACGGTCCTC CAGGACCGGT TGCAACGGCG GCGATGCCGT TCTGGCGCGA AGCGTTGCGA CTCTGTGAGC CGCATCCGGC GTGGGGCGAC TTCCCACACG ATGGATGGGC CGGATTGCAG TTCTTTGGGT GCGCAACCGA CCATGCGCTC GATACGCAAC CGCTCGGCGG GTTGAGCCGC CCGATCCTGC GCCGGATCGA CACCCGGACG GCGGCGGTTC ACGATTATGC TGCTGAACTG ACGTGGGGCG ATGGACGGGT GATCGTCAGC ACCCTTCGTC TCTACGGCGG CGCAGGTGAA CAACCTTCCG GCATCGCCCG CAACACCGCA GCCGCGTACC TGCTGCTGTG CTGGGTGCGG TATCTGCAAG GTTGA
|
Protein sequence | MLSYHMPLPG PWEFRFDDQS DWTPITVPGC WEDAGFPKDR AGPAWYRTTF VLPQELAGRR LFLRFGAVSY HCEAFIARDS GEMRSIGTHT GMWDTFDLAL GDAAAAGECV TLLVRVEKPA SLTGGPESPS QPGRFPLRET LAGFLPYVWG HSHGGIWQEV ALVAAGGTRF LDAWVRGSAD GYVFVEAELD GPARVMLELY RPDGGFILSA EEDAVREDTP TGERYMLRMN GPIPDPRPWS PADPALYRAV LRAGHDDRVE LRFGLRSFEA DGTMLRLNDK PIYPRMILSW GWRWETFAPN PGVERVRVDF ERLKRLGYNG IKCCLWFPPQ YYFDLADEMG MLVWVELPMW LPRVTDHFRQ QVYVEYERLI RQARRHPSVV IYSLGCELGK DVGADILGSL YAMTRSMCGE ALVRDNSGSG EAYGGLLNEF AQYYDYHFYA DLHFFRGLLD AFTPRWRGAH PWLFGEYCDY DTFRDLRRYR NVDGSRPWWL SADETINPTG ARWMNEAPFL EERLRAQGLW ERSAELEAIS YAHGLLHRKW TVELTRAYRE ISGYVITGEA DTPISSAGMW DATGALKYDP TEFRRFNGDL VALIGWDRRR DWVRGGDRVA WWDVWSYTAG ALVRPHLIAS HYGAESGPAR AVWSIAFDDR PPFASGEVTT DRDLMPGEVR EIGVAEFTAP DVSAPLRATL QVTLSVGAQQ TENAWSLWFF PADFWASVQH VALYDPIGRL RDLARLAPQA VEVLHGDLRG DAGARINLRG STFVVVASAW TRALSAYTRS GGRVVLIQDG DGPPGPVATA AMPFWREALR LCEPHPAWGD FPHDGWAGLQ FFGCATDHAL DTQPLGGLSR PILRRIDTRT AAVHDYAAEL TWGDGRVIVS TLRLYGGAGE QPSGIARNTA AAYLLLCWVR YLQG
|
| |