Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3612 |
Symbol | |
ID | 5541114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4722552 |
End bp | 4723787 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640895732 |
Product | extracellular solute-binding protein |
Protein accession | YP_001433679 |
Protein GI | 156743550 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.423524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCAA CGCCGTTGCC TGAGCCTTCC GCAACACCAA CCGCTACAGC GACACCGCTG CCTTCGCCAA CGACTGCGCC GACCGCGCCG CCGCCTCCCA CGCTCACGCC TGCGCCGGAA CCGTTGACGG TGTGGGCAGC AGGTGACGAA GCGCGACGTG ACGCCCTCGC GCGGCTGCTC AGCGAAGCCG CCGCCGCAGC CAACGTGCCG ATCCGCATCC GTAGCAGCAC TCCCGACGCC ATGATCGCCC GTCTGCGCGT CGATCAGATC GATGGACGGC CACCACCGGA TGTCATCTGG GGTGATGGCA ACGATCTGGC GATCCTGCGC ACCATGGGGC TGATTCAGGC AGTGCGCTCG ACGCCTGCCG CCGAGGGTAC GCTGCCGGCT GTCGTTGTGG GCGCAACCGC CGACGATCAG CAGTGGGGCG TGCCGGTTGG CGCACAGGGT TTCTTGCTGC TGCTCTACAA CCGGAAACTT GTCGAATATC CGCCGCGCAC CATCGACGCA CTGATCGCGG CTGCGCGCAC CAACACCGGC GGAGGACGGT TCGGACTGGT TGCCGGGTGG ACTGAAGCGC GCTGGTTTGC GCTCTGGCTT GACATAACAG GAGGCGCCAT GCTCGATGCC GACGGAATGC CCGCCCTCGA CGCTCCTGCT GTTGTGTCTG CGCTCGAACT GCTGCGCACC CTCCGGCGGT ATGGACCAAC GCCCCCCTCG ACCTACGATG AGGGCGCGCG ACTCTTCCGG CGCGGCAGGG TGGCGCTGGC AATCGATGGC GATTGGGCGC TGGAAAGTTA TCGGGGATTG ACCGACACCC TGGAGTTAGG CATCGCGCCG CTGCCGCTGG CGAATCGGGG CGTGCCGGCA GCCGCGCCGC TTACCGGCGT GTACTTGATG TACGGCGCTG CGCTCGACGG TCAACGCCTG GCACAGGCAG AAACGCTGGC AGCGACCCTC CGCGAACCGG TATGGCAGGC GCGCATCGCG CGCGACCTGA GCATGCTCCC GGCGTCCATT CCCGCCTTGA ACGACCCGGC GGTCACTGAC GATCCGGCGC TGGCGGCAGC TGCAACGTAT GCCGGCAATG CGCCGGGCAT CCCGCCGACC CGTCCGATTC GCTGCGCATG GGACGCCATC GAAGTCGAAC TGCCGCCATT CCTGCTCGGC AGGCGCACCG CCGCCGAAAC CGCCACCGCC ATGCAACGGC GGGCGATGGC GTGTGTGGAG CGGTAG
|
Protein sequence | MAPTPLPEPS ATPTATATPL PSPTTAPTAP PPPTLTPAPE PLTVWAAGDE ARRDALARLL SEAAAAANVP IRIRSSTPDA MIARLRVDQI DGRPPPDVIW GDGNDLAILR TMGLIQAVRS TPAAEGTLPA VVVGATADDQ QWGVPVGAQG FLLLLYNRKL VEYPPRTIDA LIAAARTNTG GGRFGLVAGW TEARWFALWL DITGGAMLDA DGMPALDAPA VVSALELLRT LRRYGPTPPS TYDEGARLFR RGRVALAIDG DWALESYRGL TDTLELGIAP LPLANRGVPA AAPLTGVYLM YGAALDGQRL AQAETLAATL REPVWQARIA RDLSMLPASI PALNDPAVTD DPALAAAATY AGNAPGIPPT RPIRCAWDAI EVELPPFLLG RRTAAETATA MQRRAMACVE R
|
| |