Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1420 |
Symbol | |
ID | 5208372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1729990 |
End bp | 1731237 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640595031 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275770 |
Protein GI | 148655565 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000217844 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0146408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACACCC GCTTTTCGCC CGCTGCTCGA GTTCCACCGA TAGCCCTGAT AGTGCTGGCG TTCGTCCTTG CCGGGTGTGG CAACCTCGGC GGACTGCTCG GCGGACAACC GACCCCTGAA CCGATCATTC TGATCGCAAC GGCGACTCCC GCACCGGTGT CTCAGGCGAC CGCCACGCTG ACCGTTGCAC CATCGCCGAC CAGCGCGCCC TCCGACGATG CTACCGCCAC CCCTGCACCG CCAACTGCGG AACCGCCCAC TCCCCCGCCG GCGCCGCAAA AGATCCTGGC GCGCGTCAAG GAGCGCGGCT ACCTGATCTG TGGAACGAAC GCCGACCTGC CTGGGTTTGG CTTCTACGAT AGCGTGCGCC AGACCTGGAG CGGCTTCGAT GTCGATTTCT GCCGCGCTGT CGCTGCCGCG ATCTTCGGGG ATGCCACAAA AGTAGAGTTC GTCGCCCTCG GCACCGGACC AGGACCGAAC AACCGGTTCG ATGCCGTGCG CGAGGGACGG GTCGACGTCC TGTTCCGCAA TACCACCTGG ACATTGGGAC GGAACATCAG CGGTCTGGCG TTTGGTCCCA CGACCTTTCA CGACGGTCAG ACCTTCATGG TACGCGCCAA AGACCGGATC ACGAAACTTG AAGATCTCGA AGGCAAGGTG ATCTGTGTTG CAAAAGGCAC CACCAGCGAG CAGAACCTGA ACGACGACTT CGCCGCGCGC GGCATCAGGT TCACTGCCCG CGTGCTTGAT GGCGAAGATG AACTCTACCC CGCCTACGAC GAAGGCGAAT GCGATGCGGT GACCAGCGAC AGTTCCCAAC TGGCTGCCAA ACGTCAGCAA CTCAAGAATC CTGCCGACCA CATCATTCTC GGCGACCGCA TCTCGCGCGA GCCGCTCGGT CCCGTCATCG CCCGCGACGA CAACCAGTGG CTCGACGTGA TCAGCTGGAC GGTCTTTGCG ACGATCTATG CAGAGGAGTT GCGTGTCGAC CAGCGCAACG TTGATCGGTT GCGCACCAGC ACAACCGATC CGCGCATCAA ACGGTTGCTG GGGTTGGAAG GAAACTTTGG CGAGGGATTG GGGCTACCGA ACGACTTCGC CTACCAGATC ATCAAGCAGG TCGGCAACTA CGGCGATATT TACAACCGCA ACCTGGGACC AAACACTGTT ATCAACCTTG ACCGCGGACC GAACAAGGTC TGGAATCTCG GCGCCGGCGG CGTGCTGGCG TCCCCGCCGT TCCGCTGA
|
Protein sequence | MHTRFSPAAR VPPIALIVLA FVLAGCGNLG GLLGGQPTPE PIILIATATP APVSQATATL TVAPSPTSAP SDDATATPAP PTAEPPTPPP APQKILARVK ERGYLICGTN ADLPGFGFYD SVRQTWSGFD VDFCRAVAAA IFGDATKVEF VALGTGPGPN NRFDAVREGR VDVLFRNTTW TLGRNISGLA FGPTTFHDGQ TFMVRAKDRI TKLEDLEGKV ICVAKGTTSE QNLNDDFAAR GIRFTARVLD GEDELYPAYD EGECDAVTSD SSQLAAKRQQ LKNPADHIIL GDRISREPLG PVIARDDNQW LDVISWTVFA TIYAEELRVD QRNVDRLRTS TTDPRIKRLL GLEGNFGEGL GLPNDFAYQI IKQVGNYGDI YNRNLGPNTV INLDRGPNKV WNLGAGGVLA SPPFR
|
| |