Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1884 |
Symbol | |
ID | 5539362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2418834 |
End bp | 2420240 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894021 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001431992 |
Protein GI | 156741863 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0104687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCAA AGATTGTCTT CATCGGCGCC GGCAGCACGG TGTTCGCCAA AAACTTGATG GGCGACATCC TGAGCTTCCC CGAACTTGCC AACGCCACGC TGACGTTGTT CGACATTGAC CCGGAGCGCT TGCGTACATC CGAAGTCGTG GCGCACAAGG TCGCTGCGGC GCTCGACGCG CGCCCGACGA TTGAGGCGAC GACCGACCGA CGCCGCGCGC TCGATGGCGC CGATTATGCC ATTTGCATGA TCCAGGTCGG CGGTTATAAG CCGTGTACGG TGACCGATTT CGAGATCCCG AAGAAGTATG GTCTGCGCCA GACGATTGCC GATACCCTGG GCATCGGCGG GATTATGCGC GGGTTGCGCA CCATTCCAGT GTTGCTCTCG ATATGCCGCG ATATGGAAGA GGTGTGTCCC GATGTGACGT TCTTGCAGTA TGTCAATCCA ATGGCCATGA ATTGCTGGGC GATCAGTCGC GCCAGCACGA TTAAGACGGT CGGGTTGTGC CACAGTGTGC AGGGCACTGC CGAGCAACTG GCGCACGACA TTGGCGTGCC GGTAGAGGAG ATCAACTATG TCTGCGCTGG CATCAACCAT ATGGCGTTCT ACCTGCGCTT CGAGCGAAAC GGCGAAGACC TCTATCCGCT CATCCGCAAG GTCTACGACG AAGGGCGCGT GCCGGCGTGG AATCGGGTGC GCTACGAAGT GTTCCGACGC CTTGGCTATT TTGTGACCGA GTCGAGCGAG CACTTCAGTG AATACGTGCC CTGGTTCATC AAGCGCGACC GACCCGATCT GATCGAGCGC TTTAATATTC CGCTCGATGA ATACATCCGC CGTTGCGAAG TGCAACTGGC GGGCTGGGAG TTCGTGCAGT ACAAACTCGA TCATCCAGAG ATTGCGAGTT CCGATCTGCG CGCTGCGATG CGCGAACGTT CCCGCACCAA TCCGTACCTG ACGGGCGAGT TCATCGACTA TGTCATCACA TCTGCCGAAG ACCTCGAGGC AGGGAAGGTC GAGCGCAGCC ACGAGTATGG CTCGCTGATC ATCCACAGCC TGGAAACGGG GCAACCGCGC GTGGTCTACG GAAATGTGCC TAACTATGGG CTGATCGATA ATCTCCCACA GGGGTGCTGC GTCGAAGTTC CGTGTCTGGT GGACAAAAAC GGCGTTCAGC CGACAAAGAT CGGCGCACTG CCGCCGCACC TCGCTGCGCT CATGCAAACG AACATCAACG TGCAGGCGTT GACCGTCGAA GCGGCGCTGA CCGGCAAACG CGAGCATATC TACCACGCGG CGATGCTCGA TCCGCACACT GCCGCTGAAC TCGATCTGGA TCAGATCTGG GCATTGGTGG ATGACTTGAT CGCCGCCCAC GGCGACTGGT TGCCGGAGTA TCGGTAG
|
Protein sequence | MPPKIVFIGA GSTVFAKNLM GDILSFPELA NATLTLFDID PERLRTSEVV AHKVAAALDA RPTIEATTDR RRALDGADYA ICMIQVGGYK PCTVTDFEIP KKYGLRQTIA DTLGIGGIMR GLRTIPVLLS ICRDMEEVCP DVTFLQYVNP MAMNCWAISR ASTIKTVGLC HSVQGTAEQL AHDIGVPVEE INYVCAGINH MAFYLRFERN GEDLYPLIRK VYDEGRVPAW NRVRYEVFRR LGYFVTESSE HFSEYVPWFI KRDRPDLIER FNIPLDEYIR RCEVQLAGWE FVQYKLDHPE IASSDLRAAM RERSRTNPYL TGEFIDYVIT SAEDLEAGKV ERSHEYGSLI IHSLETGQPR VVYGNVPNYG LIDNLPQGCC VEVPCLVDKN GVQPTKIGAL PPHLAALMQT NINVQALTVE AALTGKREHI YHAAMLDPHT AAELDLDQIW ALVDDLIAAH GDWLPEYR
|
| |