Gene Rmar_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2423 
Symbol 
ID8569089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2810668 
End bp2813802 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content66% 
IMG OID 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003291689 
Protein GI268317970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTACG GTCTCCTGCT CCTGCTGGCG TTTCTGCTGT TGCCCGAGCC CGGCGTGTGG 
GCGCAGCGAC AGACGCCGCA GCCGCCCCAC GGCTACGATC CGGCCCTGTT CGACACGCTG
CACTTTCGCA TGATCGGCCC CTTCCGGGGC GGCCGCTCGA CGGCCGTCAC GGGCGTGCGC
GGCCAGCCGC TCGTGCACTA CTTCGGCGGA ACGGGCGGCG GCGTCTGGAA AAGCACCGAC
GGCGGCCAGT CGTGGCAGAA CATCTCGGAC GGCTATTTCG GGGGCTCGAT CGGCGCCATC
GCCGTGAGCG AGTGGGACCC GAACGTGATC TACGTGGGCG GTGGGGAGGA GTCGATCCGC
GGCAACGTCT CCCACGGCTA CGGCATGTGG AAGTCGACCG ATGCCGGCAA GACGTGGACG
TTCATCGGGC TTCCCGACAG TCGCCATATC GGTGACATCG TGATCCATCC GCGCAATCCC
GACCTGGTCT ACGTGGCCGT CATGGGCCAC GCCTTTGGAC CGAACAGGGA GCGGGGCGTC
TACCGGAGCA AAGACGGCGG TAAGACCTGG GAGCAGATTC TGTTCGTGAA TGAAGATGCG
GGCGCGGTGG ACCTGGCAAT GGATCCGACG AACCCGCGCA TCCTGTACGC CACGTTCTGG
CGCTTCCGGC GCACGCCCTA CAGCTTCGAG AGCGGGGGTG AGGGCTCCAG CCTCTGGAAG
AGCACCGACG GCGGCGACAC CTGGGTGGAG CTGACGCGCA ACCCCGGCAT GCCCGAGCCG
CCCATCGGAA AGATCGGCAT CACCGTGTCG GCCGACCCGA ACCGACTCTA TGCGATCGTC
GAGGCGCGCG AGGGCGGCGT GTTTCGGTCC GACGACGGCG GCAAGACCTG GCGGCGCGTC
AACGACGACC GCAATCTCCG CCAGCGGGCC TGGTACTACA GCCACATCGT GGCCGACCCG
AAGGACCCGG ATGTCGTCTA TGTGCTGAAC GTGGGCTTCT GGAAGTCGAA AGACGGCGGC
CGCACCTTCA CGCGCATCGG CACGCCGCAC GGCGATCACC ACGACCTGTG GATCGATCCG
GACAACCCGC AGCACATGAT CATCGCCGAC GACGGCGGGG CGCAGGTCAC CTACGACGGC
GGCGAAAACT GGACGACCTA CTACAACCAG CCCACGGCCC AGTTCTACCG CGTCACGACC
GACAACGTCT TCCCCTACCG CATCTACGGG GCGCAGCAGG ACAACTCGAC CGTGCGCATC
TACAGCCGCT CGGACGGCCC CGGCATCTCC GAACGCGACT GGGAGCCGAC AGCCGGTGGC
GAAAGCGGCT GGCTGGCCCC CGATCCCAAA GACCCCGAAA TCGTCTACGG CGGCTCCTAC
GGCGGCTACC TGGAACGCTA CGACCACCGC ACGCGCCAGT CGCGCCGCGT GGACATCTGG
CCCGACAACC CCATGGGCCA CGGCGCCAAA GATCTGAAGT ATCGCTTCCA GTGGAACTTC
CCCATCATGT TCTCGCGGCA CGACCCCAAC GTGCTCTACG CGGCCGCCAA CGTGCTCTTC
AAAACCACCA ACGAAGGCCA GAGCTGGGAG CAGATCAGCC CGGACCTGAC CCGCAATGAC
ACGACGAAGA TGGGACCCTC GGGCGGACCC ATCACGAAGG ACAACACCAG CGTGGAGTAC
TACGGGACGA TCTTCGCGCT GGCCGAGTCG GTCCATGAGC CCGGCGTGAT CTGGACCGGC
TCGGACGACG GGCTCATCTA CCTGACGCGC GACGGCGGCA AGACCTGGCA GAACGTGACG
CCGCCTCCGT CCATCATGCC CGAGTGGATC CAGATCAACA GCATCGAGCC CGATCCGTTC
AACCCGGGCG GCCTCTACGT GGCGGCCACC ATGTACAAGT GGGACGACTT CCGGCCCTAT
CTCTACAAGA CCAAAGATTA CGGCCGCACC TGGCAGAAAA TCACCAACGG CATCGCCGAG
AATCATTTCA CGCGCGTCAT CCGCGCCGAT CCGGAGCGGC CGGGCCTGCT CTACGCCGGG
ACCGAGAGCG GCCTGTACAT CTCCTTCGAC GACGGGGAGC ACTGGCAGCC GTTCCAGCTC
AACCTGCCGA TCGTGCCCAT CACGGACCTG GCCGTCAAGG GCACCGACCT GATCGTGGCC
ACGCAGGGCC GGAGCTTCTG GGTGCTCGAT CATCTGGAGG TGCTCCGCCA GCTCACGCCC
GAACTGGCCC GCCAGGACGT GATCCTCTTC AAACCGAAAG ACACCTACCG GTTGCGGGGC
GGCACCTTCG GCGATCCGCC GCCGGGAATG GGCGAAAACC CGCCGCCCGG CGTGGAGATC
TTTTTCTACC TGAAGGAAAA GCCCGACACG GCCACGGTTG TGAAGCTGGA GATTCTGGAA
GAAGACGGCG ACGTGATCCG CACGTTCGCC ACCAACGCGA AGGAGCGGCG TGACCGGCTG
GAAGTGCGGG CCGGCAGCAA CAGGTTCGTC TGGGACATGC GCTACCCGGA CGCCGAGGGT
TTCGAAGGGC TGATCATGTG GGCGGCCAGC CTCCGGGGAC CGCTGGCCCC GCCGGGCACC
TATCGGGTGC GGCTCACGGT GGGCGATCAG GTGCAGGAGC AGACGTTCCG ACTGCTCAAG
GATCCCCGCA GCACGGCCAC CGACGAGGAC TTGCAGGCGC AGTTCGCCTT CCTGATGGAG
GTGCGCGACA GGGTCTCGGA AGTGCACCGT ACGATCAAGC ACATCCGGGA AATCCGGCAG
GACCTTCGGG AGGTGATCGG GCGGTTGCCG GACGACTCGA CGGGCGCAGC GCTGCGCCGC
CAGGGACGGT CCATCATCCA GAAGCTCACG CAGGTGGAAG AGGCGCTCTA TCAGACGAAG
AACCGGGCGC CGCAGGACCC GCTGAACTTC CCGATCCGGC TCAACAATAA GCTGGCCGCG
CTGATGGGTG TGGTGGGCAC GGGCGACTTC CGCCCCACGG ATCAGGCCGT GGCTGTCAAA
AACGAGCTGG TGGCGCAGAT CGACGGCTGG CTGGCCCGCT ACCGGGAAGT GATCGAACGC
GAACTACCGG CCTTCAACGA AGCCGTACTG GCGCTGCGGT TGCCACCCGT GGCCGTGCCC
GCCGCGACGG AATAA
 
Protein sequence
MRYGLLLLLA FLLLPEPGVW AQRQTPQPPH GYDPALFDTL HFRMIGPFRG GRSTAVTGVR 
GQPLVHYFGG TGGGVWKSTD GGQSWQNISD GYFGGSIGAI AVSEWDPNVI YVGGGEESIR
GNVSHGYGMW KSTDAGKTWT FIGLPDSRHI GDIVIHPRNP DLVYVAVMGH AFGPNRERGV
YRSKDGGKTW EQILFVNEDA GAVDLAMDPT NPRILYATFW RFRRTPYSFE SGGEGSSLWK
STDGGDTWVE LTRNPGMPEP PIGKIGITVS ADPNRLYAIV EAREGGVFRS DDGGKTWRRV
NDDRNLRQRA WYYSHIVADP KDPDVVYVLN VGFWKSKDGG RTFTRIGTPH GDHHDLWIDP
DNPQHMIIAD DGGAQVTYDG GENWTTYYNQ PTAQFYRVTT DNVFPYRIYG AQQDNSTVRI
YSRSDGPGIS ERDWEPTAGG ESGWLAPDPK DPEIVYGGSY GGYLERYDHR TRQSRRVDIW
PDNPMGHGAK DLKYRFQWNF PIMFSRHDPN VLYAAANVLF KTTNEGQSWE QISPDLTRND
TTKMGPSGGP ITKDNTSVEY YGTIFALAES VHEPGVIWTG SDDGLIYLTR DGGKTWQNVT
PPPSIMPEWI QINSIEPDPF NPGGLYVAAT MYKWDDFRPY LYKTKDYGRT WQKITNGIAE
NHFTRVIRAD PERPGLLYAG TESGLYISFD DGEHWQPFQL NLPIVPITDL AVKGTDLIVA
TQGRSFWVLD HLEVLRQLTP ELARQDVILF KPKDTYRLRG GTFGDPPPGM GENPPPGVEI
FFYLKEKPDT ATVVKLEILE EDGDVIRTFA TNAKERRDRL EVRAGSNRFV WDMRYPDAEG
FEGLIMWAAS LRGPLAPPGT YRVRLTVGDQ VQEQTFRLLK DPRSTATDED LQAQFAFLME
VRDRVSEVHR TIKHIREIRQ DLREVIGRLP DDSTGAALRR QGRSIIQKLT QVEEALYQTK
NRAPQDPLNF PIRLNNKLAA LMGVVGTGDF RPTDQAVAVK NELVAQIDGW LARYREVIER
ELPAFNEAVL ALRLPPVAVP AATE