Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4113 |
Symbol | |
ID | 5211096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5153447 |
End bp | 5154721 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597701 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001278407 |
Protein GI | 148658202 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG TTGTCATCGG CGGCGGCAGC ACCTACACCC CTGAACTGAT CAAAGGTCTG ATCGCCCGGA GTCCCATCCT GAATCTGCAC GAAGTGTGGT TGGTCGATCC TGATGAGGAA CGCCTGCGGA TTGTCGGTTC GTTCGCACAA CGCATGGTCA GCCATGCAAA CGCCGGATTT CGCGTCGAGT TGACCGCCGA CCGGCAGCTG GCGCTGGAAG ACGCCGATTA TGTGGTGACT CAGTTTCGTG TTGGCGGGCA GCAGGCGCGT CGCAATGACG AACTGCTTGG ACGACGGCAT CGTCTTGTCG GGCAGGAGAC GACCGGCGTC GGCGGGTTTG CCAAGGCGCT GCGCACCATT CCGGTGGCGC TCGACATTGC GCGCGATATG CGCGCAATCG CACCGCAGGC GATCCTGCTC AATTTCACAA ACCCGGCAGG TCTTGTCACC GAGGCGGTGG CGCGTCATGG CGGCGTGCCG GTGATCGGGT TGTGCAACAA TGCGATCAAT GCGCAGCGCG CGATTGCCCG CATGGTCAAT GTTCCACCCG AACAGGTGTT CATCGAGCAG GTCGGGCTGA ATCATCTGAA CTGGATCCGT CGCGTGACGA TCAACGGCGA CGATGCGACT GATGCCGTAC TCGAGGCATA TGTCGAGCAT CTGCGCCACG ACGAGGATCC GATCCATTTC CCACCCCGAT TGATCCAGAT GCTGCGCGCC ATTCCTTCGT CGTACCTGCG CTATTTCTAC CTTACCCCGC AGATCATTGC GCAGCAGGAG AGCGGTGCGC CAACCCGCGC CGACGTGGTG ATGGATGTCG AGCGACGGTT GCTCGCGCGC TACGCCGACC CGACGCTGCG CGAGATGCCG CCGGAACTGA TGGAGCGCGG CGGAGCGTAC TACTCCACAG CGGCTGCGGC GCTGATCGAA TCGCTCCACA CCGGCGACAA CGCCATTCAT GTTGTGAATA CGCGCAACAA CGGCGCTATC CCCAACCTGG ACGATGATGT GGTCGTCGAG ATGCCATGCA CGGTCGGGAA GCATGGCGCA ACGCCTATCC CCGTTGCGCC ACTAGAGCCG ATCTTCCATG GTCTGACCTG TCAGGTGAAA GCGTATGAAC TGCTGACCGT GAAAGCGGCG GTCGAGGGCG ACGAGGATGC AGCAATGCTG GCGCTGCTCA CCAACCCGCT CGGACCGGAT GCAGCGCGCG TTGAGACGGT GTGGGAGGAT ATCAAACGAA CGAACCGGGG TTTGCTTCCG ACCTTCGAGA GGTAA
|
Protein sequence | MKIVVIGGGS TYTPELIKGL IARSPILNLH EVWLVDPDEE RLRIVGSFAQ RMVSHANAGF RVELTADRQL ALEDADYVVT QFRVGGQQAR RNDELLGRRH RLVGQETTGV GGFAKALRTI PVALDIARDM RAIAPQAILL NFTNPAGLVT EAVARHGGVP VIGLCNNAIN AQRAIARMVN VPPEQVFIEQ VGLNHLNWIR RVTINGDDAT DAVLEAYVEH LRHDEDPIHF PPRLIQMLRA IPSSYLRYFY LTPQIIAQQE SGAPTRADVV MDVERRLLAR YADPTLREMP PELMERGGAY YSTAAAALIE SLHTGDNAIH VVNTRNNGAI PNLDDDVVVE MPCTVGKHGA TPIPVAPLEP IFHGLTCQVK AYELLTVKAA VEGDEDAAML ALLTNPLGPD AARVETVWED IKRTNRGLLP TFER
|
| |