Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0113 |
Symbol | |
ID | 5537573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 135103 |
End bp | 136470 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640892278 |
Product | extracellular solute-binding protein |
Protein accession | YP_001430267 |
Protein GI | 156740138 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.270734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGA TCAACCGACG ACAATTCCTG CGTGGTGTTG CAGCAGGCGC CGGTGCGCTG ACACTGGCGG CTTGCGGCGG CGCAGCGACT ACATCGCCGA CCGGTGAGCC TGCTGCCCCA CCGGCGACTG CCGCGCCGCC AACCGCGATG CCTCAGGGAT CAATGGGATC AACCGTCGAA ATCACATACT GGGGATCGTT CAGCGGCGTT CTGGGTGAAG CCGAACAGGC GACCGTCGAG ACATTCAACA GCATGCAACA GGATGTCAGG GTCAACTACC AGTTCCAGGG CAATTACGAA GAGACTGCGC AGAAACTGAC CGCTGCGGTT CAGGCGCGGC AGACCCCCGA CGTCAGCCTG CTCTCGGATG TCTGGTGGTT CTCGTTCTAC ATCAACGGTC AGTTGCAGCC GCTCGACGAC CTGATGGCAG CCGAAGGGGT CAAGCGCGAA GCGTATGTTG ATGTGCTGCT CAACGAAGGT ATTCGCAAAA ATACGGTGTA CTGGATTCCG TTCGCGCGCT CGACGCCGCT GTTCTACTAC AACAAGGACG CCTGGGCGGA AGCCGGTCTC GACGATCGCG CGCCGAAGAC GTGGGAGGAG TTTATGGAGT GGGCGCCGAA ACTCAACAGA GAAGGGCGTT CGGCTTTCGC CCACCCTGGC GCCGCCAGCT ACATCGCCTG GCTCTTCCAG GGGGTGATCT GGCAATACGG CGGTCGCTAC AGCGACCCCG ACTTTACCAT CCGCATCCAC GAAGAAAATG GCATCAGAGC GGGCAACTTC TACCGCGATA CGACCCAAAC CTACAAGTGG GCGACGACGC CGAAGGATGT AACCCAGGAC TTTGTCACCG GGCTGTCAGC CAGCGCGATG CTGAGCACCG GCGCGCTGGC AGGCGTTGAG AAGAACGCGC AGTTCCCGGT TGGCACCGGT TTTCTGCCGG AGGGTCCGTC TGGCTTCGGA TGCTGCACCG GTGGCGCAGG CATGGCAGTT CTGTCCGGAT TGCCCGCCGA GAAGCAGCAG GCTGCGATGA AATGGATCGC CTTTGCCACC GGTGATGAGT GGGCAGTAGA CTGGTCTCAG CGCACAGGGT ACATGCCGGT GCGCAAGGCG GCAGTCGAGT CGGAGCGCAT GAAGCAATAC CTCGCCGAAC GACCGAACTT CCGCACCGCC GTCGAGCAAC TGCCGAAGAC ACGTCCGCAA GACTCGGCGC GCGTCTACGT TCGCGGCGGC GACCAGATCA TTGGCAAGGG GCTGGAGCGC ATCACGATTG CCGGCGAAGA CCCGGCGAAG GTCTGGATGG ATGTCAAGGC GGAACTTGAA GAGACCGCGA AGCCAACCGT CGAACTGCTG AAGACGGTTG AGGGTTAG
|
Protein sequence | MAKINRRQFL RGVAAGAGAL TLAACGGAAT TSPTGEPAAP PATAAPPTAM PQGSMGSTVE ITYWGSFSGV LGEAEQATVE TFNSMQQDVR VNYQFQGNYE ETAQKLTAAV QARQTPDVSL LSDVWWFSFY INGQLQPLDD LMAAEGVKRE AYVDVLLNEG IRKNTVYWIP FARSTPLFYY NKDAWAEAGL DDRAPKTWEE FMEWAPKLNR EGRSAFAHPG AASYIAWLFQ GVIWQYGGRY SDPDFTIRIH EENGIRAGNF YRDTTQTYKW ATTPKDVTQD FVTGLSASAM LSTGALAGVE KNAQFPVGTG FLPEGPSGFG CCTGGAGMAV LSGLPAEKQQ AAMKWIAFAT GDEWAVDWSQ RTGYMPVRKA AVESERMKQY LAERPNFRTA VEQLPKTRPQ DSARVYVRGG DQIIGKGLER ITIAGEDPAK VWMDVKAELE ETAKPTVELL KTVEG
|
| |