Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1093 |
Symbol | |
ID | 5208040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1359985 |
End bp | 1361874 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640594707 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275451 |
Protein GI | 148655246 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0361636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00886374 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACACACG ACAGAAAAGC ACGACTGAAC CGACGCACGT TTCTGCGCGC CGCAGCTATC GGCATCGGGT CAGCGGCGCT TGCCGCCTGT GGCGGCGGCG GCGCAACCGC ACCAACCCAG CCGCCAGCGA CGCTTCCCCC GCCAACGACG GCGCCAACCC TTGCGCCGGT TCAACCAACG CCGCCGCCAA CCGCCGCCCC CGCACCAACT GCTGCACCGG CGGGCACCGT CACGAACTCG CTCGGCGTCA CCCTGCCGGA GAACGCCGCC CCGCTCGAAC ATCAGACCTT CGTCGTCTAC TTCGATATCA CCGCCGACTT CACCAGCCCT AACCAGATGG AAACGATCTA CAAGTCGGGA GGCTTCGGCA GCATCACCAA CCTGACCGGC GATACCCTCG TGCGCCTGAA CAAGGACTTC CAGGTGCAAC CGGCTGCCGC GCTCTCGTGG TCGTCCGATG AGACCGGCAA GGTCTGGACG TTCAACCTGG ACCCCAATCT GGTCTGGAGC GATGGCACAC CGGTGACGGC AGAGGACTTT GTGGCAACCT TCCGCTACGC TGCCGACCCG CAGCATGCCT GGGACTTCGC CTGGTACTAC AGTGCGCCGG GAGCGATCAA GAACTGGGAC AAGTGCGTTG CTGGCGAACT GCCGCTGGAA GAGCTTGGCG TCACAGCGAA AGATAAGCAC ACCCTGGTGA TCGAAACTGA AACGCCAGCG CCCTTCCTGC CCGCCAAACT GGTCTACAGC GAAGTGCTCA GCGCCGCGAA ACTGAAAGAG TACGGCTCCG GTCTCTACAC GGCAGACCCG GAGAAGACCA TTTCGTGCGG ACCGTACATT CTGAAAGAGT TCAAGCCGCG CGAACGGGTG GTGTTCGAGA TCAACCCGAC CTACAAAGGC ACGAACCGTC CGCGGATTGA GCGCGTCGTT CAGATTGCGG CACGCCCCGA AGCCATGTTC GCCGGATACC AGGCGGGCGA AGTTGACCGC GTGACCGGTG AGCAACTGCA AACAGCCGAC AACGAGATCA TCGCCAGAGA CCCCGAACTC TCGAAGCAGG TGCGTCTGAC CGCTGGCGAC TTCCGCACCG ATTACCTCTT CTTCGACTGC CAGAATCCGC CGTTCAACGA TGTGCGCGTG CGCCAGGCGT TCAGCCATAT CGTCGACCGC GATACGCTGA TCAAGACGAT CATCACGCCA ACACAGGGCA TCCCGGCATA CTCCTTCCTG ATGCCCGGTT TCCCGGCATC GAACTCAGAG GGGTTGAAGG ATATTCAGCG CTACGACCCT GAACTTGGGC GCGCACTGCT CAAGGAAGCT GGCTACGAAG GCGGCAAAGG GTTCCCCAAA TTGACCCTGT GGCTGCGCAA TGAGCCGCAG ATCCGCCAGG CGCTCGCCGC AGCAATTGCC GCAGCCATCA CCCAGGAGTA CGGCATCGAA GTCGAGGTCT CAAACAAGGA CTTCAAGACC TTCATGGATG CGCTCAATGC CAAGCCGACT CAGATCCAGT TCGGCATGGT GTCATATGGC ATCGACTTCC TCGACCCGTC GAACATGCTC GGCGTCTGGC TCAGCACGGG TCGCCACAAC TGGTTCAACA AGAAATTCGA CGAGATGGTG CTGTCGGCGT CGGAGAGCAC CGATCCAAAT CGTATCAAGG TGTTCCAGGA TGCGGAGCGA TTGCTGTGCG AAGAAGCGCC AGCGGTCTTC ATCTACCACC GTACCGTCGC CGACATCTAT AAGCCCTATG TCGTCGGTGA GTGCTTCGAG CCGAATATCG CCGGTTTCTC CGGTTTGCAG TGGCCCGGCT TCACATCAAT GAGCGACTCG TTGCAAACCA TGTATATGAG CAACGAGGTC ACGAAGTATC GGAAAGCGCC TCCGAGATAA
|
Protein sequence | MTHDRKARLN RRTFLRAAAI GIGSAALAAC GGGGATAPTQ PPATLPPPTT APTLAPVQPT PPPTAAPAPT AAPAGTVTNS LGVTLPENAA PLEHQTFVVY FDITADFTSP NQMETIYKSG GFGSITNLTG DTLVRLNKDF QVQPAAALSW SSDETGKVWT FNLDPNLVWS DGTPVTAEDF VATFRYAADP QHAWDFAWYY SAPGAIKNWD KCVAGELPLE ELGVTAKDKH TLVIETETPA PFLPAKLVYS EVLSAAKLKE YGSGLYTADP EKTISCGPYI LKEFKPRERV VFEINPTYKG TNRPRIERVV QIAARPEAMF AGYQAGEVDR VTGEQLQTAD NEIIARDPEL SKQVRLTAGD FRTDYLFFDC QNPPFNDVRV RQAFSHIVDR DTLIKTIITP TQGIPAYSFL MPGFPASNSE GLKDIQRYDP ELGRALLKEA GYEGGKGFPK LTLWLRNEPQ IRQALAAAIA AAITQEYGIE VEVSNKDFKT FMDALNAKPT QIQFGMVSYG IDFLDPSNML GVWLSTGRHN WFNKKFDEMV LSASESTDPN RIKVFQDAER LLCEEAPAVF IYHRTVADIY KPYVVGECFE PNIAGFSGLQ WPGFTSMSDS LQTMYMSNEV TKYRKAPPR
|
| |