Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1784 |
Symbol | |
ID | 5208743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2202532 |
End bp | 2205528 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640595392 |
Product | hypothetical protein |
Protein accession | YP_001276124 |
Protein GI | 148655919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.78034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.114546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCGA CGACATTACT GCCGTTTCGT CGGTTGCTTG CCCCGCTCGT CGGCTACGCA ATGATGGTTG TTGCCGGTTA TTGGAGTGTT CGCACAGTTG TCGGACTGAA TAGTGCGCTG CCGGTGCTTC TTATCTTCTG CACTATGGTC AATATCCTGG CGTGGCGTCG CACCGGTTCT CCGTGGTCTT GTCGGTTGCC GGTTCAGATG ACGCGCAGCG AAGGAGTGGC GCTGTTCCTT CTGATTCTGG CGACGATTGG CGTGGGGGTG TTCCCGCTGA TCCACTACAA CCAGCCTGCT GCGATCGGCG ATGGGTGGGA TATCGAGGTG GCACTGCCGA TAGCGCGCTA TCTGGAGCGC GTCCCGGTCG CAGCGATTGC GACAATGCCC GATAACCCGC TGCGTGATCT GGTCGCCCAT CCGCCTCGAA TTTCGCATAA TATCGGGTTC GCCATCTGGC AGGGGTATGT CGATCTCCTG GGAGGGTTTG AGGCGTTCGT CTCGTTTACT CCGTTGATCG CCTGGTTGCG GGGATTGGGC ATTCTGGCGA TCTATGCGCT GTTGCGCATC TGGCTTGGAT TGCCAGCCTG GACAGCGCTG GCTGGCGCCG GTCTGGCAAG CCTGAATGCG CTCCTTCTCT GGGTGTCCTA TTTCAACTTC GAGAAGCAAC TTGCCGGGTT TCCGTTGATC CCGCTGGGTC TGGTCATTGG GGCGGCTGCG GTTGAAGAGA TCGCACGTTA CCGTCTGGCA GCGTGGCGAA GCGCATTCCT GGCGGCCGTA GTCCTGTCCG CTCTGCCGGT CACCTACTAT CCGGCGATCA CGGTGTGGGC GGCGCTGGCA GCGGGGATGG GCGTGGTGCG TCTGATCGAA GCGCGCAGGA AACCTGCCGA GGCGCCGTCG CCGTGGATGC TGATACGTGC AGCCGCTGCA CTGCTGGTAC TGACCCTCAT GATCGCTGCG CCAACGGTCG AAGACTATCT CAATGGTTTT GGATTTCGCT ACAGTCATCA GGTGACATCG CTTGGCATAT TCGACTATAT TCCGGTCAGC GTTATTGTTG GATTGGAACC GTTTCTTCTG AGTCGCAGCG GATCGGTTGC GCCGGACAGC GTGGTGTATG CTGGCGGTCT GGCGCTGGGC CTGCTTGTGG CGGGTGCGCT CGCGTTCGGA CCACTTCGCC TGCGACTGGC GGGGTTGCTC ATCGGAGGGA TCGTCTATCT GGCCTGGTTG CGGTGGTGGC AGGCATATCC ATACGGCTAC ATGAAAGGCG CCGCGTATGT TGGTTTTGTT TTTTCCGCAC TGGCGGCAGC CGGAATCCAG GGGCTCCGCA GGTGGATAGC AGAACGATGG AACAACAACA TTATTCAGCG GGTCGGCGCT CACACCGCGT TGATCGTGAT TGCGACAGGT TTATGTGCGC TTATGGGGGT CAATCAAGCG CAGGTGGTGG TTGCTCACCT CGATCAACCC GGTCTCTACC CTGACGACGC TCCAACATTG CTCGCATTGC GTCAGATTAT CCCGCCCGGA AGCACGGTCA CGCTGACGTC TGATCAACGG GTACAGGGCG TTATCAGTGG GTTTGCTGCG TATGCGCTCG ACCATACGGT GGTGTGGGGG CATGTACGCA CGGGATACAC CAGGTCTCAA ACAGGCGATA TCGATGCTAT TGGTGAGTAT GGGTTACTCT ATGCTTCTGA AGATCCTCTG CTATGGGGGT ACACGCAGCC TCCAATCTGG CGCGGCGGTT CGTATGCGCT CTATCGTCGC CCGCCGGAGG TGCAGCGCCA TCTTCGCGTG CTGAAACCGC TGGTGCCAGG TGAAACGCTG ACGCTGCGTA TGAATGTCGA GCAATGGGAG GCGTCGTCCG TCACCGGTTC CGCTTTTCGC TCACTGCGAT TGATGGTCGC TTCTCTTGCG CCCGCCGCCA TCGAGATCAA TGGCATTCCC ATAGCAATAC CTCCAGGACG CCATACGATG ACCCTTGCTG TTCCGCCCTT CCAGGAAGTG AGGATTCGTC ACGTTGATGG CGCCCTGCCT CTCATCGAGA CGATCACGCT GCTGGCAGAT CCTGATCCGA ACACGATCCA GGCTATGAGA GTCGTTCACA GTACTCAACC GACAGGCGGC CCTCTGATGC GGGAGGTGAC AGGGATCGCT CTGGTCCAGG CATCGGCGGT TGCTTCTGAT ACGCATATCC TGATGACTCT GGCAGCGCTT CTGCCGGATG CTGGACCGCT GAACGTGGCG CTCGATATCT GGGATGTTGA GCGGGGCGTC CAGTATGGAT GGTATGGGCT GCTTGTTATG CCTGAGCCGG AGGTGCAGCG TTTCTCACTG TTCCTGTCGC TTGCTGATGG ACAGATGCGC GGAGTATCCG CTCAGGGCGG CGACGTGCCG CTGGGCGCGT ATTTTGCCGG GTTGCAACCT GGACGATACA CCGCCCGCCT GTATCTTGCT GCCAGTGCTC AGGTGGTGAG TGAGCCAATC GATCTGTTTG GATTTGATAT CACGTCTGAT CGCGCAATGA CGAATGTATG GACACGGGAT CATCAGATGC AGGCGATCCG CGCGATCCAT CCGACGACGT TCATCAATGT CCGGGTTGCC GATGACGTGG CGATGGTCGG ATACACTCTG CTGCCAGCGC GCCCCAAACC GGGAGATACA GTCGACCTGA TCATCTGGTG GCGCTCACTG CGCGATGGTC TGGATGAGCG CAGCGTGCTC GTGCATCTTG TTGATGCCGC CGGCACGAAA CGTGCGCAAG CGGACGGTCC GCCTGCCGCA GGAACGATGC CAACCGGGAA ATGGCGCGCC GGACTGACGA TTGTTGATGC GCGGCGCCTC ACCCTCCCGG TCGATCTGCC ATCCGGCGAC TATACGCTTT TGGTGGGGAT GTACCGCTGG CCCTCGCTCG AACGCCTGCC GCTGGTGCAG GGAAATGAGT TGCTTCCCGA AGCGGTCTTC CGGGTTCCTG TGGCAATTGG GGAGTGA
|
Protein sequence | MLPTTLLPFR RLLAPLVGYA MMVVAGYWSV RTVVGLNSAL PVLLIFCTMV NILAWRRTGS PWSCRLPVQM TRSEGVALFL LILATIGVGV FPLIHYNQPA AIGDGWDIEV ALPIARYLER VPVAAIATMP DNPLRDLVAH PPRISHNIGF AIWQGYVDLL GGFEAFVSFT PLIAWLRGLG ILAIYALLRI WLGLPAWTAL AGAGLASLNA LLLWVSYFNF EKQLAGFPLI PLGLVIGAAA VEEIARYRLA AWRSAFLAAV VLSALPVTYY PAITVWAALA AGMGVVRLIE ARRKPAEAPS PWMLIRAAAA LLVLTLMIAA PTVEDYLNGF GFRYSHQVTS LGIFDYIPVS VIVGLEPFLL SRSGSVAPDS VVYAGGLALG LLVAGALAFG PLRLRLAGLL IGGIVYLAWL RWWQAYPYGY MKGAAYVGFV FSALAAAGIQ GLRRWIAERW NNNIIQRVGA HTALIVIATG LCALMGVNQA QVVVAHLDQP GLYPDDAPTL LALRQIIPPG STVTLTSDQR VQGVISGFAA YALDHTVVWG HVRTGYTRSQ TGDIDAIGEY GLLYASEDPL LWGYTQPPIW RGGSYALYRR PPEVQRHLRV LKPLVPGETL TLRMNVEQWE ASSVTGSAFR SLRLMVASLA PAAIEINGIP IAIPPGRHTM TLAVPPFQEV RIRHVDGALP LIETITLLAD PDPNTIQAMR VVHSTQPTGG PLMREVTGIA LVQASAVASD THILMTLAAL LPDAGPLNVA LDIWDVERGV QYGWYGLLVM PEPEVQRFSL FLSLADGQMR GVSAQGGDVP LGAYFAGLQP GRYTARLYLA ASAQVVSEPI DLFGFDITSD RAMTNVWTRD HQMQAIRAIH PTTFINVRVA DDVAMVGYTL LPARPKPGDT VDLIIWWRSL RDGLDERSVL VHLVDAAGTK RAQADGPPAA GTMPTGKWRA GLTIVDARRL TLPVDLPSGD YTLLVGMYRW PSLERLPLVQ GNELLPEAVF RVPVAIGE
|
| |