Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2423 |
Symbol | |
ID | 8569089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2810668 |
End bp | 2813802 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | glycosyl hydrolase BNR repeat-containing protein |
Protein accession | YP_003291689 |
Protein GI | 268317970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTACG GTCTCCTGCT CCTGCTGGCG TTTCTGCTGT TGCCCGAGCC CGGCGTGTGG GCGCAGCGAC AGACGCCGCA GCCGCCCCAC GGCTACGATC CGGCCCTGTT CGACACGCTG CACTTTCGCA TGATCGGCCC CTTCCGGGGC GGCCGCTCGA CGGCCGTCAC GGGCGTGCGC GGCCAGCCGC TCGTGCACTA CTTCGGCGGA ACGGGCGGCG GCGTCTGGAA AAGCACCGAC GGCGGCCAGT CGTGGCAGAA CATCTCGGAC GGCTATTTCG GGGGCTCGAT CGGCGCCATC GCCGTGAGCG AGTGGGACCC GAACGTGATC TACGTGGGCG GTGGGGAGGA GTCGATCCGC GGCAACGTCT CCCACGGCTA CGGCATGTGG AAGTCGACCG ATGCCGGCAA GACGTGGACG TTCATCGGGC TTCCCGACAG TCGCCATATC GGTGACATCG TGATCCATCC GCGCAATCCC GACCTGGTCT ACGTGGCCGT CATGGGCCAC GCCTTTGGAC CGAACAGGGA GCGGGGCGTC TACCGGAGCA AAGACGGCGG TAAGACCTGG GAGCAGATTC TGTTCGTGAA TGAAGATGCG GGCGCGGTGG ACCTGGCAAT GGATCCGACG AACCCGCGCA TCCTGTACGC CACGTTCTGG CGCTTCCGGC GCACGCCCTA CAGCTTCGAG AGCGGGGGTG AGGGCTCCAG CCTCTGGAAG AGCACCGACG GCGGCGACAC CTGGGTGGAG CTGACGCGCA ACCCCGGCAT GCCCGAGCCG CCCATCGGAA AGATCGGCAT CACCGTGTCG GCCGACCCGA ACCGACTCTA TGCGATCGTC GAGGCGCGCG AGGGCGGCGT GTTTCGGTCC GACGACGGCG GCAAGACCTG GCGGCGCGTC AACGACGACC GCAATCTCCG CCAGCGGGCC TGGTACTACA GCCACATCGT GGCCGACCCG AAGGACCCGG ATGTCGTCTA TGTGCTGAAC GTGGGCTTCT GGAAGTCGAA AGACGGCGGC CGCACCTTCA CGCGCATCGG CACGCCGCAC GGCGATCACC ACGACCTGTG GATCGATCCG GACAACCCGC AGCACATGAT CATCGCCGAC GACGGCGGGG CGCAGGTCAC CTACGACGGC GGCGAAAACT GGACGACCTA CTACAACCAG CCCACGGCCC AGTTCTACCG CGTCACGACC GACAACGTCT TCCCCTACCG CATCTACGGG GCGCAGCAGG ACAACTCGAC CGTGCGCATC TACAGCCGCT CGGACGGCCC CGGCATCTCC GAACGCGACT GGGAGCCGAC AGCCGGTGGC GAAAGCGGCT GGCTGGCCCC CGATCCCAAA GACCCCGAAA TCGTCTACGG CGGCTCCTAC GGCGGCTACC TGGAACGCTA CGACCACCGC ACGCGCCAGT CGCGCCGCGT GGACATCTGG CCCGACAACC CCATGGGCCA CGGCGCCAAA GATCTGAAGT ATCGCTTCCA GTGGAACTTC CCCATCATGT TCTCGCGGCA CGACCCCAAC GTGCTCTACG CGGCCGCCAA CGTGCTCTTC AAAACCACCA ACGAAGGCCA GAGCTGGGAG CAGATCAGCC CGGACCTGAC CCGCAATGAC ACGACGAAGA TGGGACCCTC GGGCGGACCC ATCACGAAGG ACAACACCAG CGTGGAGTAC TACGGGACGA TCTTCGCGCT GGCCGAGTCG GTCCATGAGC CCGGCGTGAT CTGGACCGGC TCGGACGACG GGCTCATCTA CCTGACGCGC GACGGCGGCA AGACCTGGCA GAACGTGACG CCGCCTCCGT CCATCATGCC CGAGTGGATC CAGATCAACA GCATCGAGCC CGATCCGTTC AACCCGGGCG GCCTCTACGT GGCGGCCACC ATGTACAAGT GGGACGACTT CCGGCCCTAT CTCTACAAGA CCAAAGATTA CGGCCGCACC TGGCAGAAAA TCACCAACGG CATCGCCGAG AATCATTTCA CGCGCGTCAT CCGCGCCGAT CCGGAGCGGC CGGGCCTGCT CTACGCCGGG ACCGAGAGCG GCCTGTACAT CTCCTTCGAC GACGGGGAGC ACTGGCAGCC GTTCCAGCTC AACCTGCCGA TCGTGCCCAT CACGGACCTG GCCGTCAAGG GCACCGACCT GATCGTGGCC ACGCAGGGCC GGAGCTTCTG GGTGCTCGAT CATCTGGAGG TGCTCCGCCA GCTCACGCCC GAACTGGCCC GCCAGGACGT GATCCTCTTC AAACCGAAAG ACACCTACCG GTTGCGGGGC GGCACCTTCG GCGATCCGCC GCCGGGAATG GGCGAAAACC CGCCGCCCGG CGTGGAGATC TTTTTCTACC TGAAGGAAAA GCCCGACACG GCCACGGTTG TGAAGCTGGA GATTCTGGAA GAAGACGGCG ACGTGATCCG CACGTTCGCC ACCAACGCGA AGGAGCGGCG TGACCGGCTG GAAGTGCGGG CCGGCAGCAA CAGGTTCGTC TGGGACATGC GCTACCCGGA CGCCGAGGGT TTCGAAGGGC TGATCATGTG GGCGGCCAGC CTCCGGGGAC CGCTGGCCCC GCCGGGCACC TATCGGGTGC GGCTCACGGT GGGCGATCAG GTGCAGGAGC AGACGTTCCG ACTGCTCAAG GATCCCCGCA GCACGGCCAC CGACGAGGAC TTGCAGGCGC AGTTCGCCTT CCTGATGGAG GTGCGCGACA GGGTCTCGGA AGTGCACCGT ACGATCAAGC ACATCCGGGA AATCCGGCAG GACCTTCGGG AGGTGATCGG GCGGTTGCCG GACGACTCGA CGGGCGCAGC GCTGCGCCGC CAGGGACGGT CCATCATCCA GAAGCTCACG CAGGTGGAAG AGGCGCTCTA TCAGACGAAG AACCGGGCGC CGCAGGACCC GCTGAACTTC CCGATCCGGC TCAACAATAA GCTGGCCGCG CTGATGGGTG TGGTGGGCAC GGGCGACTTC CGCCCCACGG ATCAGGCCGT GGCTGTCAAA AACGAGCTGG TGGCGCAGAT CGACGGCTGG CTGGCCCGCT ACCGGGAAGT GATCGAACGC GAACTACCGG CCTTCAACGA AGCCGTACTG GCGCTGCGGT TGCCACCCGT GGCCGTGCCC GCCGCGACGG AATAA
|
Protein sequence | MRYGLLLLLA FLLLPEPGVW AQRQTPQPPH GYDPALFDTL HFRMIGPFRG GRSTAVTGVR GQPLVHYFGG TGGGVWKSTD GGQSWQNISD GYFGGSIGAI AVSEWDPNVI YVGGGEESIR GNVSHGYGMW KSTDAGKTWT FIGLPDSRHI GDIVIHPRNP DLVYVAVMGH AFGPNRERGV YRSKDGGKTW EQILFVNEDA GAVDLAMDPT NPRILYATFW RFRRTPYSFE SGGEGSSLWK STDGGDTWVE LTRNPGMPEP PIGKIGITVS ADPNRLYAIV EAREGGVFRS DDGGKTWRRV NDDRNLRQRA WYYSHIVADP KDPDVVYVLN VGFWKSKDGG RTFTRIGTPH GDHHDLWIDP DNPQHMIIAD DGGAQVTYDG GENWTTYYNQ PTAQFYRVTT DNVFPYRIYG AQQDNSTVRI YSRSDGPGIS ERDWEPTAGG ESGWLAPDPK DPEIVYGGSY GGYLERYDHR TRQSRRVDIW PDNPMGHGAK DLKYRFQWNF PIMFSRHDPN VLYAAANVLF KTTNEGQSWE QISPDLTRND TTKMGPSGGP ITKDNTSVEY YGTIFALAES VHEPGVIWTG SDDGLIYLTR DGGKTWQNVT PPPSIMPEWI QINSIEPDPF NPGGLYVAAT MYKWDDFRPY LYKTKDYGRT WQKITNGIAE NHFTRVIRAD PERPGLLYAG TESGLYISFD DGEHWQPFQL NLPIVPITDL AVKGTDLIVA TQGRSFWVLD HLEVLRQLTP ELARQDVILF KPKDTYRLRG GTFGDPPPGM GENPPPGVEI FFYLKEKPDT ATVVKLEILE EDGDVIRTFA TNAKERRDRL EVRAGSNRFV WDMRYPDAEG FEGLIMWAAS LRGPLAPPGT YRVRLTVGDQ VQEQTFRLLK DPRSTATDED LQAQFAFLME VRDRVSEVHR TIKHIREIRQ DLREVIGRLP DDSTGAALRR QGRSIIQKLT QVEEALYQTK NRAPQDPLNF PIRLNNKLAA LMGVVGTGDF RPTDQAVAVK NELVAQIDGW LARYREVIER ELPAFNEAVL ALRLPPVAVP AATE
|
| |