Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4489 |
Symbol | |
ID | 5211474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5628318 |
End bp | 5629322 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640598068 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001278771 |
Protein GI | 148658566 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.311346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000667621 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCATGTCA GGATGGTTCA ACTGCGGCTG ATGCTGCTGG TGCTGGCGCT GCCCGTTATG CTTGCCGCCT GCACACAACC GACATCCTCG CCAGGCGGAG CCGGCGGCGA TGCTCCCCGC CGGTGGCGTA TCGGCATGTC GCAGGCGAAC AACGCCGAAC CGTGGCACCA GGCGATGAAT GCGCAGATCG CTGCGGCTGC CGCCCGTCAT CCCAACATTG AGATCGAGTT CACTGATGCG CGCCAGAACA ATGCACAGCA GATTGCAGAC GTGGAAGCCT TCCTGAGCAA GGGTATCGAT CTCCTCATCA TTTCGCCCAA CGAAGCCTCC CCGCTGACCC CGATTGTAGC GAGCGCATTT CAGCGCGGCA TTCCGGTGAT CGTGCTGGAT CGCAAAGTGA AGGGCGAACA GTACACGATG TGGATCGGCG CCGATAACCG CCTGATCGGG CGCAAGGCGG GCGAATATAC GGCGCGCTGG TGCCGCGAAC AGCAGCGATC ACCGTGTACG GTCATCGAAC TGCGTGGACT GGAAGGCTCG ACCCCCACGC AGGAGCGCGG CGACGGCTTC CGCGAAGGGA TCGCCGCCAA CCCGGATGTG CGCATTATTG CCAGTCAGAA CGCCGACTGG CTCGCCGAAC GGGCTGACGC GCTTGCGCGC GTTCTCTTTG AAGCAAACCC GGATGTCGAT GTGGTCTATG CCCACAACGA TCCTATGGGC ATTGCCGCCT TCAATGTTGC AAAGGAGCAG GGGCGCGATA CCGACGCCAT CCTCTTTATC GGCATCGATG CGCTTGCGAC CTCCGATGGC GGTATTCAGG CGGTGCGGCA GGGCAAACTC AATGTGACAT ACGTCTATCC TACGGGCGGC GCCGAAGCCA TCGAATGGGC GCTGCGAATA CTGGAACAGC GCGAGACGCC GCCGCGCGAA ATCATTCTCG ATACTGAAGA AGTCACCACT GCCAACGCCG ATGCCATGTT CCAGAAATAC GGAGGCCGGG AATGA
|
Protein sequence | MHVRMVQLRL MLLVLALPVM LAACTQPTSS PGGAGGDAPR RWRIGMSQAN NAEPWHQAMN AQIAAAAARH PNIEIEFTDA RQNNAQQIAD VEAFLSKGID LLIISPNEAS PLTPIVASAF QRGIPVIVLD RKVKGEQYTM WIGADNRLIG RKAGEYTARW CREQQRSPCT VIELRGLEGS TPTQERGDGF REGIAANPDV RIIASQNADW LAERADALAR VLFEANPDVD VVYAHNDPMG IAAFNVAKEQ GRDTDAILFI GIDALATSDG GIQAVRQGKL NVTYVYPTGG AEAIEWALRI LEQRETPPRE IILDTEEVTT ANADAMFQKY GGRE
|
| |