Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4429 |
Symbol | |
ID | 5541942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5697498 |
End bp | 5698790 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640896527 |
Product | extracellular solute-binding protein |
Protein accession | YP_001434463 |
Protein GI | 156744334 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATGCC GTCTGGTTGC CATTCTACTG GTTGCTGCGC TGCTCGCCGC ATGTGAGTCG GGCAGCACGC CCGAAACCTC TTCCGCTCCG ACGACTGTTG CACAAGATTC CAGCGTTGGA CCACTCCTCC TCTGGCATGG ATGGTCGGGC GGTGATCGGC AGGCGCTGGG GCGCCTGGTG GATCGCTATA ACCGTCAGCA GCGCGACGGG CGTATTGTGC TGCAATCAGT GCCGCTGGCA GGCTTCGCCG CCGAACTGCG CGCAGCCGTG GCAGCAGGCA GTGGTCCTCA TCTGATTCTG ATTCCAAATA CCTGGATCGG TGGGCTGGCG GAGGCGGGGG TGTTGCTGCC GCTCAACGAT CTGGTGCCGG CGCAGGAGAC CGGTACGCTT CTGCCGGTGA CGCTGGCGGG AGCGCAGGCG CGCGATGCCG CCGGAACACT GCGGTTGTAC GGTGCACCGG TACGCTTCGA TACGCTGGCG CTCTACTACA ATGCTGCCAA TCTCACCGAG CCCCCCGCCG ATACCGCAAC CATGCTCGCT GTTGGACGCG GCTTGAGCGA CCCGGAAGCC CAACCACCGA TCTGGGGACT GGCGCTCAAC CTGTCGTATG ACAATATGAT CGGGTATCTC TACGCCTTTG ACGGGCGGAT ATTCGATGAC AACGGGCAGG TTGCGCTCGG TACAGCCGGT CGTGCTGGCG CAGAACAATG GCTCGCCTGG TTGATCGCGC TGCAAAATGA TCCGCGCATT CTGGCGCGGA GCGAGAGTAG CATCCTGGTC GATCGTGAAT TGAAAGATGG GCGCGCCTTT ATGACGTTTG ATTGGGCGCA TCAGATCGGT GTCTATCGTG GTCTGTGGGG CAATCAGATC GGCATTGCGC CGTTGCCACG TCTGAGTGAA ACGGGACGGG CGCCACGTCC ATATGTGCGC GCAGATGTCC TGGCGATCAA TAATCTTGCC GGGGTACGTG AGCGCGAGGC GGCTGCACGG TTTATCCGTT TCATGATCAG CGAAGAAGCG CAGGCTGTTC TGCTGCAAAG TGATATGCAA CCGGCATCGC GCACACTGGC GCTGACCGGC GATTCGCCAC AGGAGATCGC CGCACAGGTG TTTCGCGTCC AGGCGGAACA GGGGCTTCCC ATGCCCAACT CGAGTGTGCG CGCCTTTGTG GAGCAGGAAA TCAAACGCAT GCAACGCCAG GCGTCGCTCG GTCTCACCAC ACCATCCGAT GCAGTTACTG AGGCCGACCG CCGGCTGCGC GAACGATTGG AACCTTCTGC GCCAATGCCT TAA
|
Protein sequence | MRCRLVAILL VAALLAACES GSTPETSSAP TTVAQDSSVG PLLLWHGWSG GDRQALGRLV DRYNRQQRDG RIVLQSVPLA GFAAELRAAV AAGSGPHLIL IPNTWIGGLA EAGVLLPLND LVPAQETGTL LPVTLAGAQA RDAAGTLRLY GAPVRFDTLA LYYNAANLTE PPADTATMLA VGRGLSDPEA QPPIWGLALN LSYDNMIGYL YAFDGRIFDD NGQVALGTAG RAGAEQWLAW LIALQNDPRI LARSESSILV DRELKDGRAF MTFDWAHQIG VYRGLWGNQI GIAPLPRLSE TGRAPRPYVR ADVLAINNLA GVREREAAAR FIRFMISEEA QAVLLQSDMQ PASRTLALTG DSPQEIAAQV FRVQAEQGLP MPNSSVRAFV EQEIKRMQRQ ASLGLTTPSD AVTEADRRLR ERLEPSAPMP
|
| |