Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1385 |
Symbol | |
ID | 5538858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1772144 |
End bp | 1773418 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893523 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001431499 |
Protein GI | 156741370 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.382298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATCG TTGTCATTGG CGGCGGCAGC ACCTATACCC CCGAACTCAT CAACGGTCTG ATCACCCGGA GTGCAACCCT GCACGTGCGC ACGGTCTGGC TTGTCGATCC CGACGAGGAG CGCCTGCACA TCGTTGGTTC ATTTGTGCAG CGCATGGTTC GTCACGCCGG CGCAGGATTT CAAGTGGAGT TGACCACGGA ACGACGCCGG GCGCTCGAAG GCGCCGATTA TGTCATCACG CAGTTTCGCG TCGGCGGGCA GCAGGCGCGT CATAACGACG AACTGCTCGG ACGGCGGCAT CACCTCGTCG GGCAGGAGAC GACCGGCGTT GGCGGGTTTG CCAAGGCGTT GCGCACTATT CCCATTGTGC TCGATGTTGC GCGTGATATG CGTGCGAACG CACCGCAGGC GATTCTGCTC AATTTCACCA ATCCGGCTGG CATCGTCACC GAGGCGGTAG CGCGTCACGG CGGCGTGCCG GTCATTGGGC TGTGCAACAA TGCGATCAAT GCGCAGCGCG GCATTGCGCG CATGTGCGAT GTGCCGCCGG AACAGGTGTT CATCGAGCAG GTTGGACTGA ACCACCTGAA CTGGATCCGG CGGGTGACGA TCAATGGCGA GGACGCGACC AACGCCGTGA TCGCGGCGTA TGTCGAGCGT CTGCGCCACG ATGACGATCC GCTCCGTTTT CCACCCCGGC TCATTCACAT ACTGCGCGCC ATTCCGTCGT CATATCTGCG CTATTTCTAT CTTACGCCGC AGATCATTGC GCGGCAGAAC AGCGGCGAGC CGACCCGCGC CGAAGTGGTG ATGGAAGTCG AGCGCCGACT GCTTGCGCGC TACGCCGACC CGACGCTGCG TGAGATGCCG CCGGAACTGA TGGAACGCGG CGGCGCGTAC TACTCGACGG CGGCTGCGGC GCTGATCGAA TCGCTCTATA CCGACGACAA CGCCATTCAC GTGGTGAATA CGCGCAACAA TGGCGCTATC CCCAACCTCG CCGACGATGT GGTCGTTGAA ATGCCATGTG CGGTTGGGAA ATGCGGCGCC ACGCCCATTC CCGTTGCTCC GCTCGAGCCA GCCTTCCACG GGCTGACCTG CCAGGTGAAA GCCTATGAAC TGCTCACCGT GCAGGCAGCC GTCGAGGGGA ACGAAGAAGC AGCGATGCTG GCGTTACTTG CCAACCCGCT CGGTCCCGAC GCGGCACACG TTGAAGCCGT TTGGGAGGAC ATCAAACGAA CGAATCGCGG TCTGCTTCCC ACTTTCGAGA GGTAA
|
Protein sequence | MNIVVIGGGS TYTPELINGL ITRSATLHVR TVWLVDPDEE RLHIVGSFVQ RMVRHAGAGF QVELTTERRR ALEGADYVIT QFRVGGQQAR HNDELLGRRH HLVGQETTGV GGFAKALRTI PIVLDVARDM RANAPQAILL NFTNPAGIVT EAVARHGGVP VIGLCNNAIN AQRGIARMCD VPPEQVFIEQ VGLNHLNWIR RVTINGEDAT NAVIAAYVER LRHDDDPLRF PPRLIHILRA IPSSYLRYFY LTPQIIARQN SGEPTRAEVV MEVERRLLAR YADPTLREMP PELMERGGAY YSTAAAALIE SLYTDDNAIH VVNTRNNGAI PNLADDVVVE MPCAVGKCGA TPIPVAPLEP AFHGLTCQVK AYELLTVQAA VEGNEEAAML ALLANPLGPD AAHVEAVWED IKRTNRGLLP TFER
|
| |