Gene Rmar_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0081 
Symbol 
ID8566706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp88795 
End bp91740 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content67% 
IMG OID 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003289378 
Protein GI268315659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGACT GCATTGTCAT CCGAGGCGCA CGCGAGCACA ACCTCAAAAA CATCGACCTG 
GACATCCCCC GGGAGCGCCT GGTGGTGATC ACCGGCCTTT CGGGCTCGGG CAAGTCGAGC
CTGGCCTTCG ACACGATCTA CGCCGAAGGC CAGCGGCGCT ACCTGGAGAG CCTTTCGGCC
TACGCCCGCC AGTTTCTGGG CATGCTCGAG CGGCCCGACG TGGACTTCAT CGACGGGCTG
TCGCCCGTCA TCGCCATCGA GCAGAAGACG GTCAGCCAGA ACCCGCGCTC GACGGTCGGC
ACCGTCACCG AGGTGTACGA TTTTCTGCGG CTGCTCTATG CCCGCGTCGC CACGGCCTAC
TCGCACATCT CGGGCAAGCC CATGCGCCGC CAGACCGACG ACGAAATCAT CAACCACATC
CTGAGTTTTC CGGAAGGCAC CCGGCTGCTG ATCCTGGCGC CGGTGGTGCG CGGCCGCAAA
GGGCACTACC GGGAGCTGTT CGAGCAGATC GCCCGCCAGG GCTTCGAGCG CGTGCGCGTC
GACGGCGAGC TGCGGGAAAT CACGCCGGGC ATGAAGCTTG ACCGCTACAA GACGCACGAC
ATCGAAGTCG TCGTCGACCG GATCGTGGTG CGGCCCGACA TCCGGCCGCG CGTGCACGAT
TCGGTCCAGA TCGCCCTCGG CATGGGCGGC GGCGTGCTCA TCGCCCACGT GCTCAACCCC
GACGGCATGG GCACCGACCA CGTCTTCAGC CGCCACCTCT ACTGCCCCGA AGACGGCGTC
TCCTACGACG ATCCCTCGCC GAACACGTTC TCGTTCAACT CGCCCTACGG CGCCTGTCCG
GCCTGCAACG GCCTGGGCAC CCGGCTGGAA ATCGACCCGG ATCTGGTCAT CCCCGACCCG
TCGAAGTCGA TCAACGAAGG AGGACTGGCG CCGCTGGGCA GGCCGCGCGA CGTCTGGATC
TTCAGCCAGC TCCGCGCCGT GGCCGCGGCC TACGGGTTCG ACTTCGACAC GCCGCTGGGC
GCGCTCACCG AGGAGCAGCG GCGCGTGCTG CTGGAAGGCG CCGGCGACCG TGAGTTCGAG
ATCGAATACC GCTTCAAGGA TCGCACGCTG CGCTACCGCC ACCGCTTCGG CGGCATTTAC
GAATACCTGC ACCACCTGCG CGAGCATGCC GGGTCGGTCT CCCAGCGGCG CTGGGCCGAG
TCGTTCATGC GCGAGATGAC CTGCCAGGCC TGCGGCGGCG CCCGCCTGAA GCCCGAAAGC
CTGGCCTTCC GCATCGGCGG CAAGAACATC GCCGAGCTGG CGCACATGGA CCTGGTCTCG
CTGCGGCGCT TTCTGTCGGA GCTGCAGTTC GAGGGCCAGA AGGCGCTCAT CGCCCGTCCC
ATTCTCAAGG AAATTCTGGA GCGGCTCGGC TTTCTGATCG AAGTCGGCGT GGGCTATCTG
ACGCTGGACC GTCCGGCGCG CACGCTCTCG GGTGGCGAAA GCCAGCGCAT CCGGCTGGCC
GCCCAGCTCG GCTCGCAGCT GACCGGCGTG CTCTACGTGC TCGACGAGCC CTCGATCGGC
CTGCACCCCC GCGACCATCA CCGGCTCATC GACGCGCTCA AGCTGCTGCG CGATCTGGGC
AACTCGATCC TCGTCGTCGA GCACGACCGC GAGATGATCG AGGCGGCCGA CTACGTGATC
GATCTGGGTC CGGGCGCGGG CGAACACGGC GGCCATGTGG TGGCGGCGGG ACCGCCCGAC
CGCCTGGAGG TGCGCGACGG CCACGTCAGC CTGACGGCCG CCTACCTGAA GGGTGCGCGC
TACATCCCGG TACCCCGCGA GCGGCGCACC GGCAACGGCC ACCGGCTCGT GCTCTACGGC
GCCCGCGGCC ACAACCTGAA GGGCGATCCG CTCGTGCTGC CGCTGGGCAC CTTCATCTGC
GTGACGGGCG TTTCGGGCTC GGGTAAGTCG AGCCTGATCA ACCAGACGCT CTACCCGATC
CTGGCCCGCC ACTTCCACAA CGCCCGCCTG GTGCCGCTGC CTTACGAGCG CATCGAGGGG
CTCGAGCATC TGGACAAGGT GATCGACATC GACCAGAAGC CCATCGGCCG CACGCCCCGC
TCGAATCCGG CCACCTACAC GGGACTGTTC AGCCATATCC GCGACCTGTT CGCCCGGCTT
CCCGAAGCCC AGCTCCGCGG CTACAAGCCC GGCCGGTTCT CCTTCAACGT AAAGGGCGGG
CGCTGCGAGA CCTGCAAGGG CGCCGGCGTG GTGAAGCTGG AGATGAACTT CCTGCCGGAC
GTGCACGTGA CCTGCGAGAC CTGCAGGGGC CGCCGTTACA ACGCCGAAAC GCTGGAAGTC
CGCTATCGGG GCAAGAGCAT CGCCGACGTG CTGGAGATGA CCGTGGACGA GGCGCTGGCG
TTCTTCGAAA ACATGCCGCG CATCGCCCGC AAGCTGCGCA CGCTGCAGGC CGTCGGGCTC
GGCTACATCC GGCTGGGCCA GCCCGCCACC ACGCTGTCGG GCGGCGAGGC CCAGCGTATC
AAGCTGGCCC GCGAGCTGTC ACGTCCCGGT ACCGGCAACA CGCTCTACAT CCTGGACGAA
CCCACCACCG GCTTGCACTT CGAGGACATC CGGCATCTGC TGGCCGTGCT CCGGGCGCTC
GTACGCAAAG GCAACACGGT GCTGGTGATC GAGCACAACC TCGACGTGAT CAAGGTGGCC
GACCATGTGA TCGAGCTGGG ACCCGAAGGC GGCGACGCCG GCGGCCACAT CCTGTTTGCG
GGCACGCCCG AAGAGCTGGC GGCTCAGGAC ACGCACACCG GCCGCTTCCT GCGCAAAGAG
CTGGCCCGCG CGGCCGCGCT CGCCGACGGC GCCGACGAGG AACTCATCGA CCTCGACCAG
CTCGTCACCG ACGAAGAAAC GCCCGAAGAA GAAATCGAAG ACGAACTGCT CGACGAAGAG
GAATAA
 
Protein sequence
MQDCIVIRGA REHNLKNIDL DIPRERLVVI TGLSGSGKSS LAFDTIYAEG QRRYLESLSA 
YARQFLGMLE RPDVDFIDGL SPVIAIEQKT VSQNPRSTVG TVTEVYDFLR LLYARVATAY
SHISGKPMRR QTDDEIINHI LSFPEGTRLL ILAPVVRGRK GHYRELFEQI ARQGFERVRV
DGELREITPG MKLDRYKTHD IEVVVDRIVV RPDIRPRVHD SVQIALGMGG GVLIAHVLNP
DGMGTDHVFS RHLYCPEDGV SYDDPSPNTF SFNSPYGACP ACNGLGTRLE IDPDLVIPDP
SKSINEGGLA PLGRPRDVWI FSQLRAVAAA YGFDFDTPLG ALTEEQRRVL LEGAGDREFE
IEYRFKDRTL RYRHRFGGIY EYLHHLREHA GSVSQRRWAE SFMREMTCQA CGGARLKPES
LAFRIGGKNI AELAHMDLVS LRRFLSELQF EGQKALIARP ILKEILERLG FLIEVGVGYL
TLDRPARTLS GGESQRIRLA AQLGSQLTGV LYVLDEPSIG LHPRDHHRLI DALKLLRDLG
NSILVVEHDR EMIEAADYVI DLGPGAGEHG GHVVAAGPPD RLEVRDGHVS LTAAYLKGAR
YIPVPRERRT GNGHRLVLYG ARGHNLKGDP LVLPLGTFIC VTGVSGSGKS SLINQTLYPI
LARHFHNARL VPLPYERIEG LEHLDKVIDI DQKPIGRTPR SNPATYTGLF SHIRDLFARL
PEAQLRGYKP GRFSFNVKGG RCETCKGAGV VKLEMNFLPD VHVTCETCRG RRYNAETLEV
RYRGKSIADV LEMTVDEALA FFENMPRIAR KLRTLQAVGL GYIRLGQPAT TLSGGEAQRI
KLARELSRPG TGNTLYILDE PTTGLHFEDI RHLLAVLRAL VRKGNTVLVI EHNLDVIKVA
DHVIELGPEG GDAGGHILFA GTPEELAAQD THTGRFLRKE LARAAALADG ADEELIDLDQ
LVTDEETPEE EIEDELLDEE E