Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1803 |
Symbol | |
ID | 5208762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2226671 |
End bp | 2227537 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640595411 |
Product | extracellular solute-binding protein |
Protein accession | YP_001276143 |
Protein GI | 148655938 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.363702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.037594 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAGC AGACGCTGAT CATGCTTGCG TTCATTGCCG CTTCACTGGT TATGAGCGCA TGCGGTGGAG CACCCGCCGC ACAACCGACC CAACCCGCTG CACAACCGAC CCAACCCACA GCACAGCCAA CCCAACCCGC CGCTCAGACC GATGGAAAAC TGGCGCAGAT CCGGGCAGCT GGCAAACTCA TCGTCGGCAC CTCAGCAGAT TACCCGCCCT ACGAGTCGAT CGACGAAAAT GGCAACTTCG TCGGTTTCGA CATGGACCTC ATTCGCGCCG TCGGCGAGAA ACTCGGCGTT GAGGTCGAGA TCCGCGATAT GCCGTTCGAT TCGCTGATTG CGTCGCTCCA GGAAGGGAAG ATCGATGCCG TCATCGCCGC AATGCAGGCG ACTGCCGAAC GTGAAGAAAA GGTCGATTTC ACTATTCCGT ACCGCATGAC GAAGGATGCG TTCATCGGCG CCGGTAACAC GACAATTACC CTGAACAAGC CGGAGGACGC CGCCGGTCTG ACGATTGGCG CGCAGACCGG TACAGTGCAG GAGGGTTGGA TTCAGAAGAA TCTGGTCGCT ACGGGGTTGA CTCCGGCTGA TAAGGTGTTC AGTTATGAGC GCGCTGATCA GGCGGCGCTC GACCTCGCCA GCGGACGCCT TCAACTGGTG TTGATGGATG CCGAGCCTGC GCTGGAACTG GCGAAACAGA ACGGCTTGAA AGTCCTGCTC ATCACCGAAG AGACTGCCGA AGGCGGGAAG AGCATCGCCA TCCCCGAAGG CGCCAGCGAT CTCAAGGCGG AACTTGATCG GATCATCCAG CAGTTGATCG ACGACGGTAC GGTCAAGCAG TTGCAAGATA AGCACGGCTT GCCGTGA
|
Protein sequence | MQKQTLIMLA FIAASLVMSA CGGAPAAQPT QPAAQPTQPT AQPTQPAAQT DGKLAQIRAA GKLIVGTSAD YPPYESIDEN GNFVGFDMDL IRAVGEKLGV EVEIRDMPFD SLIASLQEGK IDAVIAAMQA TAEREEKVDF TIPYRMTKDA FIGAGNTTIT LNKPEDAAGL TIGAQTGTVQ EGWIQKNLVA TGLTPADKVF SYERADQAAL DLASGRLQLV LMDAEPALEL AKQNGLKVLL ITEETAEGGK SIAIPEGASD LKAELDRIIQ QLIDDGTVKQ LQDKHGLP
|
| |