Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0439 |
Symbol | |
ID | 8567073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 483558 |
End bp | 485276 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003289729 |
Protein GI | 268316010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00635902 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGCT GGCTGCTTTC GATGCTGTTC CTGAGCGTGG GGGCCATCCC GGCGGCCGCG CAGCGCTGGA CGGTAGAAGT TCAGGCCCGC GTCAGTCGCA CCCAGGTGGG CATGGGCGAG ACGGTCGTCT ACACCGTCGA CATCATCGGC GCCCTGCCCG ACGGATTGCG CATGCCGGAG CCGCCCCCGA CCGAAGGGCT GGCGCTCGTC TATCCGACGC CCACCGTACA CCGCCAGTAC TCGCTGATCG GCGGGCGCAT CCAGCAACGT ATCCGCTACA GCTGGCGGTT TCGGCCGGTG CGTCCCGGCA AAGCCCGCAT CGGCGCCTTC CGGCTTCAGG TGGGCGATCA GACGCTGGCC ACCGATCCGA TCGAACTGAC CGTCTACGAT GCACCCACGG CACCGCCCGC CGAAGTGGCG CCGAACGCAT CCGCTGCACC CGACCTGTTT ATCGAGGCCA CCCTGCAGCC CACCATCCCC TACGAAGGGC AACAGACCGT GCTCGAATAC CGACTGTTCT TCCGGGAAGG GCTGCAGGTC TGGCGTAGCC GCCTGGTGGG CTCCTGGGAA ACGGAAGGGT TCTGGCGCGA GGAACTTGAG GTGGAAGATC GGCCGCTGCC CGAGCGGGTA CTTCGCAACG GCGCGGCCTA TTACACCTTC GTGCTGCAGC GCATCGCGCT GTTTCCCACG CGGAGCGGAA TGCTTCAGAT CGATCCGCTC ACCATCGAAA GCGAAGTCTC GCTGCAGCCG GATCCGCTCG ATCCCTTCGC CTCGCTGCTG CGTCCGGGTC CGAGCGAACC GCAGCGCGTG ACGGCTCCGG CGCTCACCGT GACGGTGCGT CCGCTCCCGG ACGGAGCGCC GTCGGGTTTC GTCGATGCCG TGGGACGATT CCGGCTGGAA GCCGAAGCCT CACCTACCGA AGTAAGTGTC GGCGAGCCGG TAACCGTCAC GATCCGTCTG TCCGGATCAG GAAACCTGCC CACGCTGACG CCGCCCCGGC CGGACGTGGA CACCAGCTTC GCCGTCTATC CCGCCGACGA TGCGCTGACG CTCGACCGCC GCGGCGCGCG GCTGACCGGC ACCCGCACGC TGTCCTACGT GCTCGTGCCC GGCCGGGCCG GGCGTTTCAC GCTGCCACCC GTGCGCCTCG CCTACTTCGA TCCGGAGGGC GGCGTCTATC GCACGCTGAC GGCTCCGCTG CCGGAGCTTG TCGTGCATCC GGCCTCGGCC ACGCCCACTG CTCCGGTGGC GACCGCTTCC CCGGCGGCCG TTCATTCTCG GTCATTTTCA TGGATCCGGT GGCTGCCCTA TGCCGGCGGA GGGCTTCTGG TGATCCTACT GCTGCTGTTG CTGGCGCAGC GACGGCGTGC GGCCCGCCGC CCGCGACCGT CGCCCGTGCA ACCCGACGCG CTTCCGCCGT CCACGCTGGA GCCCCGCCTG TTCTACCGGC GACTGGAAGA AACGCTGCGA CGAGCGGCCG GACGCTACCT GGGCGAAGAC GTGCGCGGCC TGACGCGCTC CCGGCTGTGC GACCGCCTGC GCCAACGCGG CCTGCCCGAA GCGGAAGTGG CCCGCATGGC CCGATTGCTG GCCGCCTGCG AGGCCGCCTG CTACGCCCCG ACGCCTCCCG ATCCCATCCA GACGCTCCAC GACCACCGCG AGGCTGCCCG TCTCCTGGAG TTCCTCGAAA AAAAGGCCCC TTCCGAAGAA GGGGCCTGA
|
Protein sequence | MRRWLLSMLF LSVGAIPAAA QRWTVEVQAR VSRTQVGMGE TVVYTVDIIG ALPDGLRMPE PPPTEGLALV YPTPTVHRQY SLIGGRIQQR IRYSWRFRPV RPGKARIGAF RLQVGDQTLA TDPIELTVYD APTAPPAEVA PNASAAPDLF IEATLQPTIP YEGQQTVLEY RLFFREGLQV WRSRLVGSWE TEGFWREELE VEDRPLPERV LRNGAAYYTF VLQRIALFPT RSGMLQIDPL TIESEVSLQP DPLDPFASLL RPGPSEPQRV TAPALTVTVR PLPDGAPSGF VDAVGRFRLE AEASPTEVSV GEPVTVTIRL SGSGNLPTLT PPRPDVDTSF AVYPADDALT LDRRGARLTG TRTLSYVLVP GRAGRFTLPP VRLAYFDPEG GVYRTLTAPL PELVVHPASA TPTAPVATAS PAAVHSRSFS WIRWLPYAGG GLLVILLLLL LAQRRRAARR PRPSPVQPDA LPPSTLEPRL FYRRLEETLR RAAGRYLGED VRGLTRSRLC DRLRQRGLPE AEVARMARLL AACEAACYAP TPPDPIQTLH DHREAARLLE FLEKKAPSEE GA
|
| |