Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3566 |
Symbol | |
ID | 5541067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4655754 |
End bp | 4657118 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895685 |
Product | extracellular solute-binding protein |
Protein accession | YP_001433633 |
Protein GI | 156743504 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.42847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.297516 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACAC GCACCCGTCT GAGTCGACGA CAGTTTCTGC GCAGTGCAGC CGTGGGAGGA GCAGCGCTTG CCTCCGGCAT TCTGGCGGCT TGCGGCTCGT CGCCAACAGC GCCCACAACC GGTAACCCGA CAGCAGCAGC GCCGACGCAG GTCTCCAGCG AACCAACGAA GATTCGTGCG CTTATGTGGA GTAATGGACC GGTCATCGAC GAGAACTTCC GAGTCCGCGC GCAGATGTTC AACGAAGCGT TCAAGGGGCA GTACGATCTC GATCTGCAAC TTCTGCCCTA CGACCAGTAC TGGCCCCGCA TCGACCTGGC ATATGGCTCG AAGAACCCAT ACGACCTCTA CTTCTTCGAC GTGCAGGCAT ACGGACACTA CCGGGCAGGG TTGCTCGCCA ATATCCAGCC GTATGTCGAT CTGGCGCCGG AACTGATGAA CGCCGAGGAG TATCCGGTGG CACTGTACGA TGCCTGGCGC TTCGACGGCA GCAATCTCTA CGGCTTGCCG GAAAATATCC AGGTGCTGGC GCTCTACTAC AACCGTGATC TTTTCGATGC CGAGGGGCTG GCATACCCCG ACGAGACCTG GACGTGGGAC GATGTGATCA ATGCCGCCAC GAAACTGACG AAGCGCAGCG GCGATGAGAC CACGCAGTGG GGGATGGATG TCGGCGTGAT GGATATCTGG TGGGGCGCGC AGACGCTGGC GTGGGCGATG GGTGGCGGTT TTTTCGATAA GATCGTCGAG CCGACGAAGT TTCAGGTCAG CGATCCGGTC AATGTGCAGG CGCTCACCTT TCTCCGCGAC CTGATCTTCG AGTATAAAGT CGCTCCCACC AAAACCCAGC GTTCCGCGAC AGCACAGGAT ATTGGCATTT TCCAGACCGG CAAGGTGGCG ATGTTCTTCG ATGGCAGCTG GGCGATCAGT GGTTTCCAGG ATGTGCCGTT CAAGTGGGAT ATGGCGCCGT TGCCCATGTG GAAGGATAAG CGCGTCTCCG CTTACTGGCT TGGCGGGCAG GTCATTCCGA AAGACTCGAA GGTCATCGAC GCCGCCTTCG CCTTTTCGCG CTGGTCGGCA ACAACGTATC AGAAGACGAT GGCGTCCAAC CACGACTGGA TACCAATCGC GCGTTCGGCG CGCGAGTCCG AGGAGATGTA TGTCGGGCAA CCAGCCGGGC TGCGCAAAGT GCTCGGCACT ATCGAAGGTG CACGACTTGG TGATTTTTAC TCACGCAACA ATCAGCAGAT CTTCGGCGAG GTGCTGCTGC CGACGTTCGA TCAGTTGTTC CTCGGCAACC TGACGCCGGA AGAGGCGGCA AAGAAGATCG ATGAAGAAGC CAATGCGCTT CTTGCGAAAG GATGA
|
Protein sequence | MGTRTRLSRR QFLRSAAVGG AALASGILAA CGSSPTAPTT GNPTAAAPTQ VSSEPTKIRA LMWSNGPVID ENFRVRAQMF NEAFKGQYDL DLQLLPYDQY WPRIDLAYGS KNPYDLYFFD VQAYGHYRAG LLANIQPYVD LAPELMNAEE YPVALYDAWR FDGSNLYGLP ENIQVLALYY NRDLFDAEGL AYPDETWTWD DVINAATKLT KRSGDETTQW GMDVGVMDIW WGAQTLAWAM GGGFFDKIVE PTKFQVSDPV NVQALTFLRD LIFEYKVAPT KTQRSATAQD IGIFQTGKVA MFFDGSWAIS GFQDVPFKWD MAPLPMWKDK RVSAYWLGGQ VIPKDSKVID AAFAFSRWSA TTYQKTMASN HDWIPIARSA RESEEMYVGQ PAGLRKVLGT IEGARLGDFY SRNNQQIFGE VLLPTFDQLF LGNLTPEEAA KKIDEEANAL LAKG
|
| |