Gene Rmar_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0142 
Symbol 
ID8566767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp154927 
End bp157977 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content63% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003289438 
Protein GI268315719 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.335318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCTG AAATAGCTAC CGAAGCCTCT GTATCGTTTG ATCGGGTGGT GGACGACATT 
GCGGCCGAGG TTGCCGCCGG TTGGCGAACC TGGCAGCAGG CCGAACAGCG GCGGTGGGCG
CAATGGGAAG CGATCGTTCA GGAAGCTGCG CAGGCGCACG GCGAAGCGCT GGCCTTGCTG
GAAAAGGGAG ATAGGCAGGG CTATGCAGCC CTTGTCGAAG CGCAGGTGCT GCGTCCATTG
CAGCAGGCCT GGCATGCGCA GGTTTTGCCG GAAGGGATTC CGCCAACTCC TTCCTTTGAA
GCGCTGGGGC CGGAGCGTTG GCCGGAGCAG GTGCGCGTGC CAGTGACACC TGAACTATTC
GCGGATGAAG CCAAAGGTTG GGTCGGACGC CGACTGCGGT GTCGAATAGC CCGTCGGATT
TTGACGCTAT GGAATCGGCG CTGGCCGCAA CGACCTTGCA GGCGTACCGT CCCGCTGCGG
CTTCTGCTGG CATTTCACCT GGAGGTCCGT GTACCCCGAG CCTGGTGGCC GGTCTATGAA
GCCTGTCGGA CCCACTGGGC TCGGGCGGTC GGTCTGGTGG AGAAAGGGCT GGCCACCTGG
CAGGAAACGG TACGGACAGC GTTGCCGCTC GACGCGGAAG CGGCACCCTC TGCGGAGGTT
CTGGAAGGCG CACGGGCTTT GCAGGAAGTG CTGGTAGCGG CAATTCAGGA GCGGCTCCCG
GATCTTTCCA TTCCGGAAGC GTCGCTTGCG AAAGAACTGG TCCGGGATCT GGAGCGAGCC
GGGACGTTTC TGCTGCGTCG GCGTCATCGG CGGTTGCCAC GAAAAAAACG TCGCTACCAC
AGTCGCTGGG AACGCATCGA AAGCGCCTGG CAACAGTGGT TTCGGCAGGC TGCTGACCGT
GCCGCGCTGT TAGCGGCACT GGTACAGGTG ACTTGTCAGC TGCGAGCCCA ACAGGATCGG
CTTGCTGACC GACTGGAGCA GGCCGTTCGC ATGCCGCGCG CCCGTGTGGT CGAGCAGTTC
AGGAGCTACT TCGATCGGGT TCAGGAACGA TTGGCGCAAG CGCTGGCCGA GGAAACAGAT
GACCCTGAAA GGCTGCGTCA GCATCTGCAG GAAGCCCTGA CCGCTCGGGA ATCACTGCGC
CAACAGCTTT ACCAGCAGCC TGACCTGAGG CAACTGAGCA CCGTGCTGGA GGCCCCGGCA
CGCGACGAAT GGGCGGCGGT TACGGCGACC GTGCAGACCC TCCCCGAGCA GTGTGTACTG
CATCCTGCGG GTCGGACGCT CCGACCGGAG GGTCCGGCAC TGACGATCCC ACTCAGGGCG
CTGGTACTGG AACACCTCGA GCCGCCCTGG CCCGAGCACC TGGAGGCCGC GTCGTTGCGC
TTGCGTCAGG CGGTGCTGAA AAGCTGGAGT GCGCTCGAGG AGGTTTTGGA GATTGTCGGC
TACAATCTGG AGGCGGCCCT GGACGAGATA AGGGCGGAGG CGCCCGGTGG AGACCTGCGC
ACCCGGCTCG AGGAGCTGGC ACTCGGGAGT CTGCAGCATG CCGGAAGACG CCTGGAAGAA
AGTACCCGCG AGCTGGAAGA AGCGCTGGGT GCTTTTCGGG AAGCCCTGGC CGAGGAGATC
CGCAGCGACT GGCGCACACT GGAGCAGCGA CTTCAGGCTG AGATGGCGCA GCGGGCTCAG
TGGAAGCGGC TGCGCCTGCG TCTGCAGCAC CAGGCCGGTC AGTGGCTGTA CCTGGCCCAC
CAACAGGGGA AGCGACAGTG GAAGGCCCTG CAGGCTGCTT CGCGACGAAT CCTGCGCCGG
GCGCGCCAGC TTGTGCGGTG GGGCCAGGCG GCAATCGGGA TGGGTGCTGA CACTGGCGCG
ATCGATCAGG TGCGTGCCGC CGACGTGCTG CTCAGCATCG AGGAGGTGCG CGCGCGGCTC
CCTCTGGTCT ATCGACGACT ATTCACGTTC GAGCCCCTGG ACGATCCAGC TTTGTTTGTG
GGATTTGAGC TCGAGCGCCG TCAGATCGCC GGCTGGTACG GTCGCTGGTG CGAGGGTCGT
AGCAGTAGTG CCGTGGTGGT GACGGCTTTT CCGGGAACAG GGATGACCAG CATGCTCAAC
GTGCTGACAG CCACGGTCTT TGCCGAAGCT CGTGTCTGTC GATTGACGTT GCAGGAGCGG
GTGCGGGATG AAGCGCATCT GGCCGTACTC CTGGCGCAGG CGCTCCAACT CCCAGAAACG
CCGGACACCC TTGAGCGCGT CGAAGCAGCT ATTGGACAAC GTTTTGCTCG CGAAAAGCCA
ACCGTCGTGT TGCTCGACAA CCTGGAGCAT GTGTTGCTAT GCACCTACAA TGGCCAGCAG
TGGCTGGAGC GTCTGCTGAT TCTCTTTGCC CGAACGGATC GGCATGTATT CTGGGTGGCC
GGGATTGCCC GGCCGGCCTG GTTCTTCTTT GAACGCACGG CCCGGAATGC GGTGGGGCTG
GTGCAGCTCT GTCCGCTCCG GGAGCCGGAC CGGGCACTTC TGGAGCAGGC CATCGAAGCG
CGACATCTTC GGAGCGGGCT ACTGCTGCGT TTTGAGCCGC CGGCACGACC TTCGCCCGTG
CTACGTCAGC GGTTGCGTCG GGCCAGTACG CCCGAGGCAC AACAGGCTAT TCTGCGCGAG
GTGTTCTTCG ATCGGCTGTA TCAGGAAGCC GGTCCGAATT GGCGACTGGC GTTGCTCTAC
TGGCTGCGCT CGGTTCAGGT GGAGGATGGC GGTCTGCGCG TGCGCCCGAT CGCTGCGCTG
ACCTTTGACT TTCTGGAACA GCTCAGCCTC GAACAGGCCT TCACGCTCAA AGCATTTCTG
CGACATCGCA CGCTGACGCT GGAAGAACAT CAGCAACTGT TCCGCAGCAC ACCGGCTCAG
AGCCTTTTTG TGCTGGAGTC GCTGCTGAAT CAGCATCTGA TTGAACCAGA GAAAAAAGAG
GAAGCACCGA TGGAAGGGCT GCAGCCGGGC GTTCGCTATC GGCTGGTGCC GTTCTTCGTG
CAGCCGGTGC GACGGGTGCT CCAGACCCGG CACATCCTGT ACGAAGGCTA A
 
Protein sequence
MNPEIATEAS VSFDRVVDDI AAEVAAGWRT WQQAEQRRWA QWEAIVQEAA QAHGEALALL 
EKGDRQGYAA LVEAQVLRPL QQAWHAQVLP EGIPPTPSFE ALGPERWPEQ VRVPVTPELF
ADEAKGWVGR RLRCRIARRI LTLWNRRWPQ RPCRRTVPLR LLLAFHLEVR VPRAWWPVYE
ACRTHWARAV GLVEKGLATW QETVRTALPL DAEAAPSAEV LEGARALQEV LVAAIQERLP
DLSIPEASLA KELVRDLERA GTFLLRRRHR RLPRKKRRYH SRWERIESAW QQWFRQAADR
AALLAALVQV TCQLRAQQDR LADRLEQAVR MPRARVVEQF RSYFDRVQER LAQALAEETD
DPERLRQHLQ EALTARESLR QQLYQQPDLR QLSTVLEAPA RDEWAAVTAT VQTLPEQCVL
HPAGRTLRPE GPALTIPLRA LVLEHLEPPW PEHLEAASLR LRQAVLKSWS ALEEVLEIVG
YNLEAALDEI RAEAPGGDLR TRLEELALGS LQHAGRRLEE STRELEEALG AFREALAEEI
RSDWRTLEQR LQAEMAQRAQ WKRLRLRLQH QAGQWLYLAH QQGKRQWKAL QAASRRILRR
ARQLVRWGQA AIGMGADTGA IDQVRAADVL LSIEEVRARL PLVYRRLFTF EPLDDPALFV
GFELERRQIA GWYGRWCEGR SSSAVVVTAF PGTGMTSMLN VLTATVFAEA RVCRLTLQER
VRDEAHLAVL LAQALQLPET PDTLERVEAA IGQRFAREKP TVVLLDNLEH VLLCTYNGQQ
WLERLLILFA RTDRHVFWVA GIARPAWFFF ERTARNAVGL VQLCPLREPD RALLEQAIEA
RHLRSGLLLR FEPPARPSPV LRQRLRRAST PEAQQAILRE VFFDRLYQEA GPNWRLALLY
WLRSVQVEDG GLRVRPIAAL TFDFLEQLSL EQAFTLKAFL RHRTLTLEEH QQLFRSTPAQ
SLFVLESLLN QHLIEPEKKE EAPMEGLQPG VRYRLVPFFV QPVRRVLQTR HILYEG