Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3152 |
Symbol | |
ID | 5540650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4088059 |
End bp | 4090971 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895273 |
Product | extracellular solute-binding protein |
Protein accession | YP_001433224 |
Protein GI | 156743095 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGC CAGACCGCGC ATGTCCCTCA CTCTGGTTCT CCTGGCGATG CACCGCTCGT ATCTTTTCGT CAGCGCTCGA CCAACTCTTG AACAGGCGTG GCTTCCGGCA ATTGTGGAAA GGCAGGCATA TGCAGCGTGT ATGGATACTT CTGACCTTCT GGGCGGTGCT GCTGGCATCG TGCGGCGCGC CGCCACCATC GGGTAATGCG ACTCCGGCAG TCCCCGGAGT AACGCCGACG AGCGGGAGCG AGACGGTCAC GATCTCGTTT GCGGTGTGGG AGTACGAGCG CAACATCTAT CAACCGCTGG CGGATCGCTT TATGACTGAG AACCCCAACA TCAAAATTGT GCTGGTGTCG CTCGATGACA TCATGATGTT CGAGCCGGGC AACGAGGGAC CGACCAATCC GCTCGATGTT CTACGGCGGA TCGTGTCAGC CGCCGATACT GCTCCCGCAT TCGCGCTGAC GCCGGAAGCG TTTGGCACAC CGCTGCTGCT CGACCTCAAA CCGTTGATGG ACGCGGACGC TGCGTTTCAG CGCGACGACT TCTTCCCCGG CGCCATCGAG CGCTACGCGG CGAAGGGCGG GGTCTGGGCG CTACCGCGCT ACCATGGCGT GCCGCTGATC GTCTACAATC GCGCCTTGTT TCAGAATGCC AATCTATCCG AACCACGCCC TGGCTGGACG TGGGATGATC TGATGTCCGC CGCCGAGCGT CTCACCGAAC GCAGCGGTCT GACGACCCTC ACCTACGGGT TTATGGAACC AACCAATGGT CTGCTGCCGC TCCTGGGGTT GCTCGAAAAA CAGGGGATCA ATCCACTAAC GGTCGTCGCG GAAGAGACGG ATCTCACTGC GCCGGAGTAT ATTGCAGCAG TCGAGCGCAT CCAGGAATGG TATCGCGCGG GTGTGCTGGT CGCACCCTAT GCGCGCGACG CCGCTTCTGA CGATCCCACG CGCCTGGCGC GCGAAGGGCG GGTCGCACTC TGGAGCGACA TGTCCTACAT CAGCAACGAC GATGGCTCGC CCTGGACGCC GGATTTTCCG GTCGGTCGTG CGCCATTCCC GAAGAATGCC ATTCTCGATC CCTTCTTCAG CTCAACCGAA GGGTTCATCA TCAGCGGCGG CACAGCGCAC CCCGACGCCG CCTGGCGCTG GATCGAGTGG CTCTCGCGCC AACCGGCGCT CGAGGACCAG CAAAGTTTCG GTCCATTGTC GCGTTTGCCT GCCCGCCAAT CGGTGGCGCA GCAGACGGAG TTCTGGAGCA AACTCGATCC GCCGACTGCT GAGGCGTACC GGTGGGCGAT AGAGAACAGC GGGACGCTGC CAAGCCAGCA GTTCGATTAT ACCGTGATGG GGGCGTTGAG TCAGGCAGTT GCGACGGTCA CCGGCGACCC AAAGGCGAAT GTGCGCCAGG CTCTGGCAGA AGCGCAGCGG CAGATGCAGG AAGCGATTGC GCAGCGGGAA CTCACGCCAA CGCCGAAGCC AAACCTCAAT CCGGTGGTAG TGGCTACGCC GGAGCCGCAG GAAGCGCCGG CCGGCGCCAC AACCGTGACG TTTGGCCTTT ACGGGTACAA TCCGACCGAA CTGCGCCGCG TCGCGCGCGC GTTTCGCGAA CAGCGTCCCG ACATCTTCGT GCAGTTGAAG CCGTTCGAGT GGACGCCCGA CTTGAAGGAG GTCACGGCTG CCACCCTGGC GCAGACCAGC GACTGTTTTT TCTGGTTCGG TCCGCTCGCC GGCAACGATG AAACGGCTCT GCTCGATCTG CGCCCATTGA TCGAGGCAGA CTCCACCTTC CCGCGCGATG ACATCGTCCC GGCGGCGCTG GCACAGTACA GCCGCGAGGG GCGCATCTAC GGGTTGCCCT ACGCGGTCAA TATGCGCAGC CTGATTTACA ACCATGCCGC ATTCGAGGCG GCGGGAGTGC AACCGCCGCG CGCCGACTGG AAACCGGACG ATTTTCTGGC GGCGGCGCAG GCGTTGACCA GAGGCGAAGG GAACGAGCAG CGGTGGGGGT ATGTGCCGCT GGGAGGTCCG CAAGCCGACC TGTTGTTCTT CATCAACCAA TTTGGCGCGC GCCTGATGGT TGGCGAGGGC AAGGACCTGC GCCCGAACTA CACCGATCCG AAGACTATCG CTGCCATCCG CTGGTACCTC GATTTGAGCA TGGTTCACAA GGTGGCGCCG CCGCTCACCA TCGTCTATCG GCGCGACGAT TTTGGCGGCG GGGATCGTTC CTACGAACTG GTCCAATCCG GTCGGGTCGG GATGTGGTTT GGGTATCCTG AATCATTCCA GGGAGGGGTC GTGATTCAAC CGCTCGAACC GGGTGGTGGA CCGGCGCCGA CTGCCGCGCC GCTGACCCCG GTCGAACGCG ACATTCGAGC TGCGCCACTG CCGGTCGGCG GCGCCGGTGT GCCGTCGAGC GAAGTGTTTT TGCGGGGCCT GTTCATCTCG GCGCGGACGC AGCAACCGCA GGCATGCTGG GAATGGATCA AGTTCCTTTC CGGCGACACG TCCCTGACGT ATGGCGATAT GCCCGCCCGC CGCTCGGTCG CACAATCTGA GGCATTCATC AAACAGTTGC CGCCGGAGCG AGTCGCGCAG TTCGAGGCGA TGCGGGCAAC CCTGGCGATG CCGTCGCAGA ACGCCGGTGA TGCTAACGTG ATCTACAGCC AGCACACCGA TCCCTACTGG TTCTTTAAGG CGCTTAGCGC CGTGGTGGAA AAGGGCGCAA GCCTCGAAAA GGAACTTGCC GAAGCGCAGC GCTTTGCAAC CGCCTTTGCC GAGTGTATGA ACCGTGAGGA TGCGCGTGCG CCGGCATGCG CAAAAGAAGT CGATCCAGAC TATAAAGGGT ACAATGTCGA GGAGGGCGAT CCGACGCCTC TTCCCGCCGG CGCTGCGCCG TAG
|
Protein sequence | MTLPDRACPS LWFSWRCTAR IFSSALDQLL NRRGFRQLWK GRHMQRVWIL LTFWAVLLAS CGAPPPSGNA TPAVPGVTPT SGSETVTISF AVWEYERNIY QPLADRFMTE NPNIKIVLVS LDDIMMFEPG NEGPTNPLDV LRRIVSAADT APAFALTPEA FGTPLLLDLK PLMDADAAFQ RDDFFPGAIE RYAAKGGVWA LPRYHGVPLI VYNRALFQNA NLSEPRPGWT WDDLMSAAER LTERSGLTTL TYGFMEPTNG LLPLLGLLEK QGINPLTVVA EETDLTAPEY IAAVERIQEW YRAGVLVAPY ARDAASDDPT RLAREGRVAL WSDMSYISND DGSPWTPDFP VGRAPFPKNA ILDPFFSSTE GFIISGGTAH PDAAWRWIEW LSRQPALEDQ QSFGPLSRLP ARQSVAQQTE FWSKLDPPTA EAYRWAIENS GTLPSQQFDY TVMGALSQAV ATVTGDPKAN VRQALAEAQR QMQEAIAQRE LTPTPKPNLN PVVVATPEPQ EAPAGATTVT FGLYGYNPTE LRRVARAFRE QRPDIFVQLK PFEWTPDLKE VTAATLAQTS DCFFWFGPLA GNDETALLDL RPLIEADSTF PRDDIVPAAL AQYSREGRIY GLPYAVNMRS LIYNHAAFEA AGVQPPRADW KPDDFLAAAQ ALTRGEGNEQ RWGYVPLGGP QADLLFFINQ FGARLMVGEG KDLRPNYTDP KTIAAIRWYL DLSMVHKVAP PLTIVYRRDD FGGGDRSYEL VQSGRVGMWF GYPESFQGGV VIQPLEPGGG PAPTAAPLTP VERDIRAAPL PVGGAGVPSS EVFLRGLFIS ARTQQPQACW EWIKFLSGDT SLTYGDMPAR RSVAQSEAFI KQLPPERVAQ FEAMRATLAM PSQNAGDANV IYSQHTDPYW FFKALSAVVE KGASLEKELA EAQRFATAFA ECMNREDARA PACAKEVDPD YKGYNVEEGD PTPLPAGAAP
|
| |