Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0725 |
Symbol | |
ID | 5207664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 892657 |
End bp | 894327 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594339 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275091 |
Protein GI | 148654886 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGTC GCATCCGCTG GCAAATCCTG ATTGCGTCGA TCAGTGCTCT GACGGTGCTG CTCCTGATGA GTTACCTGGC GCTCACACGC GCATCGGTGG CGCGCCCGCT GGCAGGAGGC GATTATATCG AAGGGGTTGT CGGCGCGCCG GTGCATCTCA ATCCCCTGGT CGCTGATCCA GCCTCCGATC CGGTCGCCGC CGACATCCAG CGCCTGGTCT TCGAAGGGTT GACCCGCCCC GGTCCCGACG GTCTACCCAT GCCGGCACTC GCCGAGTCGT GGGCGGTTGA TGAGAGCGGG ACAGTCTACA CCTTTACACT GCGAAGCGGC GCGTCGTGGC ACGATGGTGC GCCGGTGACG GTCGATGACG TGCTGTTCAC GCTGCGCGCT GTGCAGGGTC CGGCGTTCGC CGGTGATCAG AACGTTGCCG CTTTCTGGCG TACTGTCCTG GTTGATCGCG TCGGAGAACG CAGCGTCAGT TTCCGCTTGG AAGCGCCATT TGCGCCATTT CTGCGGTTGA CCGGTTTCCC GATCCTCCCC GCGCACCTGC TGCGTAACGT TCCGCCGGAA CAGTGGGAGG CGCATCCATT CAACCGCTTG CCGGTCGGCG CCGGTCCGTA CCGCCTGGTG GAACTCGATG AACAGCGTGC GCTGCTACGC GCCAACCCGC GTTATTTCGG CGCAACCCCG TTCATCGAAA CGATTGAACT GCGCTTTTTT CGCACTGAAC AGGAAGCATT CGCTGCGCTG ACCCGCAGCG AGATTCAGGG TCTGGCATTC ACCGGCGCCA GCGCCCTGGC GGATGTCAAC CTGCCACGCG GCATTGTGCG ACGTCAGGCG CTGCTGGATG GATACACGGC GCTCTCCTTC AACCTGCGCG ACGGTCTGCT CACCGATCTC GGCGTGCGAC GCGCACTGGC GACTGCACTC GACAAGGATG CCCTGATCGC CAGTGCGCTT GCGGGGAAGG TGATGCGGCT CGATACGCCG ATTCTGCATG GATGGTGGGC GGAAACGTCC GATGTGTCGT GGTATGAGCC AGACGTTGCC CGCGCAATGG CGCAACTTGA CACGCTGGGG TACGTCCCGG GCGCCGATGG CGTCCGTGTT CGGGACGGTC AACCACTCGT CTTTTCGCTG CTGACCGATA ACTCTCCAAC GCGGCGTGCG GTTGCAGAAG AGATTGCCCG TCAGTGGAGC GCCATCGGGG TGCAGATCGT TATCGAACCG GTCGAACCGA CCGAAATGCA ACGCCGGCTG GAAACGCATG AATTCACCAT CGCACTGCAC GGATGGCAGC GGCTCGGTTC CGATCCCGAT GTCTTCGAAC TCTGGCATTC GAGTCAGGCG GAGCGCGGAC GCAATTACGC AGGTCTCGCA GATGCCACTA TCGACGAGAT TCTCTCCAGC GCGCGCAAAA TCTACGACAT CACGGATCGT GCCGAACTCT ACCGTGAATT TCAGGAACGC TGGGTCGAAC TGGCGCCGGG GATCATTCTC TATCAACCGA TCCTGTTCCA CGCCACAGTT GCCGACCTCG GCGACACGAT TGCCGTCCCG CCGGATGCCG CCGCTTCGCC CCATCTGCTG ATCGGGCGCG AGGGGCGTTT TGTGAATGTC AACCGCTGGT ATCTGCGTAG CGCTCGCGAG ATCCGCGGTG ATTTGCGATA A
|
Protein sequence | MARRIRWQIL IASISALTVL LLMSYLALTR ASVARPLAGG DYIEGVVGAP VHLNPLVADP ASDPVAADIQ RLVFEGLTRP GPDGLPMPAL AESWAVDESG TVYTFTLRSG ASWHDGAPVT VDDVLFTLRA VQGPAFAGDQ NVAAFWRTVL VDRVGERSVS FRLEAPFAPF LRLTGFPILP AHLLRNVPPE QWEAHPFNRL PVGAGPYRLV ELDEQRALLR ANPRYFGATP FIETIELRFF RTEQEAFAAL TRSEIQGLAF TGASALADVN LPRGIVRRQA LLDGYTALSF NLRDGLLTDL GVRRALATAL DKDALIASAL AGKVMRLDTP ILHGWWAETS DVSWYEPDVA RAMAQLDTLG YVPGADGVRV RDGQPLVFSL LTDNSPTRRA VAEEIARQWS AIGVQIVIEP VEPTEMQRRL ETHEFTIALH GWQRLGSDPD VFELWHSSQA ERGRNYAGLA DATIDEILSS ARKIYDITDR AELYREFQER WVELAPGIIL YQPILFHATV ADLGDTIAVP PDAAASPHLL IGREGRFVNV NRWYLRSARE IRGDLR
|
| |