Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4246 |
Symbol | |
ID | 5541757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5488068 |
End bp | 5489849 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896353 |
Product | alpha amylase catalytic region |
Protein accession | YP_001434291 |
Protein GI | 156744162 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.286976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC GCGCACTCTT TCTGATCATC GTTCTTATCA TCGTTGGGTG CGGTGCGCCT TCTGCGGCAG TTCCGCCAAC TGCCACACCG CCTGCCGCAC CTTCCGCTAC GCCGTTAAGC GAAGAAGCAG CGCTGCTCGC CACCATGGCG GCGAATGCGC CCTCTCCCGC AGAGGCGACC CCCACCGTCC CACTGCCGAC CCTGTTTCCG ACTATCACCC TCGGACCGAC TGCGGAACCG CGACCGTTGC CGGCCGGATG GTGGGACACC GCCGTGTGCT ATGAAATCTT CGTGCGCTCG TTCTACGACA GCGACGGCGA TGGGATCGGC GATATCAACG GGCTGATCGC AAAACTCGAC TACATCAACG ATGGCGATCC AACCGGCGGT TCCGACCTTG GCGCCAACTG TATCTGGCTG ATGCCGGTGG CGGAAGCCGC CAGCTACCAC GGTTACGATG TCATCGACTA TAACGCCATC GAAAAAGATT ATGGCACAAA CGACGATTTC AAGCGTCTTG TGGCAGAGGC GAACCGGCGC GGCATCCGGG TGATCGTCGA TCTAGTGTTG AACCACACAT CCAGCGCGCA TCCCTGGTTT ATCTCGGCGC TGAACGATCC TGCATCTCCC TATCGCGACT GGTACATCTG GTCGCCGGTC GATCCCGGCT ACCGCGGACC GTGGGGGCAA CAGGTCTGGC ACCGTTCACC GGCGCGCAAC GAGTACTACT ATGGCGTTTT CGTCGCCGAA ATGCCCGATC TGAACTACCG CAACCCGGAA GTCGTCGCTG AAGCGGAGAA GATCGCCGCC TTCTGGCTGA ACGAGATGGG AGTCGATGGG TTCCGCCTCG ATGCGATCAA GCACATTGTG GAAAACGGTT CCGAGCAGGA AGGCACGCGC GAAACCCATG CCTGGATGCG CTCATTCGAG GCAGCCATCG AGCGCATCAA ACCGGGAGCG TTCACCGTTG GCGAAGTCTT CGGCGGGCGC GCCGGGTCGC TTGACGCCTA CTATCCCGAC CAACTCGACA CCTACTTCGA GTTTGGCGTT GCGGAAGGAA TCTTGCGCTC CGCCAACACC GGCGCGCCGG GACCGTACCT GACGGCGGTG GAAGATGCGC TCATGCGCCT TCCATACCAG CGTTGGGCGC CATTCCTCAC CAACCACGAC CAGGAACGGG TGATGACTGT CCTCGGCGGC GATGCCGGTA AGGCGCGCGT TGCGGCCATC GCCCTGCTGA CTCTTCCGGG TCTCCCATTT GTGTACTATG GCGAGGAGAT CGGTATGACG GGCGCCAAAC CGGATGAGCG CATCCGCACG CCGATGCAGT GGACGGGCGA ACCGGGAGCG GGATTCACGA CCGGTTCGCC CTGGCAGGCG CCCCAGAGCG ATTTCCCGAC GGTCAACGTC GCCGCGCAGG ATACCGATCC CGACTCGCTG CTCAACGTCT ATCGGACCCT CATCCGGTTG CACACCACCC GTCCGGCGCT CGGCAAGGGC GATTTCACCG CGCTGAACGC GACCGGCGGC GCAGCGGCGT TCCTGCGGCG CAGCGGCGAC GACGCGGCGC TGGTGGTGAT CAACTTCAGC GCCAACCCGC TCTCCGGCGT GACCCTCTCG ACTGCGCAGA GCAATCTCGA TCCCGGCGCC TACACCCCTG AACTGCTCTT CGGCAGTGGA AACCTGTCGC CGCTCATCGT CGGCGCCAAC GGGTCAATCA GCGCGTATGC GCTGCCTGAG ATTCCGCCGC AGCGCGCGGT GATTGTTGGG TTGGGGAGAT GA
|
Protein sequence | MKARALFLII VLIIVGCGAP SAAVPPTATP PAAPSATPLS EEAALLATMA ANAPSPAEAT PTVPLPTLFP TITLGPTAEP RPLPAGWWDT AVCYEIFVRS FYDSDGDGIG DINGLIAKLD YINDGDPTGG SDLGANCIWL MPVAEAASYH GYDVIDYNAI EKDYGTNDDF KRLVAEANRR GIRVIVDLVL NHTSSAHPWF ISALNDPASP YRDWYIWSPV DPGYRGPWGQ QVWHRSPARN EYYYGVFVAE MPDLNYRNPE VVAEAEKIAA FWLNEMGVDG FRLDAIKHIV ENGSEQEGTR ETHAWMRSFE AAIERIKPGA FTVGEVFGGR AGSLDAYYPD QLDTYFEFGV AEGILRSANT GAPGPYLTAV EDALMRLPYQ RWAPFLTNHD QERVMTVLGG DAGKARVAAI ALLTLPGLPF VYYGEEIGMT GAKPDERIRT PMQWTGEPGA GFTTGSPWQA PQSDFPTVNV AAQDTDPDSL LNVYRTLIRL HTTRPALGKG DFTALNATGG AAAFLRRSGD DAALVVINFS ANPLSGVTLS TAQSNLDPGA YTPELLFGSG NLSPLIVGAN GSISAYALPE IPPQRAVIVG LGR
|
| |