Gene Rmar_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0954 
Symbol 
ID8567593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1086717 
End bp1087877 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003290236 
Protein GI268316517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACGG TTCAACACAC ACTGCGCACG TTTTGCGCAC TACTGGGACT GGGTTTGCTG 
CTGGGATGGA CGTACGTGGC CCGCGCACAG GTGGGCGGAG CCGCCGTGCT GTTTCTGCAG
ATCGAACCCG ACAGCCGGGC CGCCGGTATG GGCAATGCCG GCGTGGCCGT GGCGGACAAC
GCCTATGCGC TGTTCTGGAA CCCGGCGGGG CTGGCGTATC AGCCCGAGGC CGTCGAGGTA
TCGCTCACGC ACTCGAACTG GCTGCCGGAA TTCAACGCCG GCCTGTACTA CGAATACCTG
GTCGGGCGCT TCTCGGTAGG TAAGTTCGGC AACATGGGCG CCCATGTGAC GCTGCTCAAC
ATGGGTGAGC ACGAGTGGCG GGATGAAAAC AACAACCCGC TCGGCACTTT CCGCTCCTAC
GATGTGGCCG TGGGCGTCTC CTACGGCTAT CCGATCAGCG AGCGACTGGC GCTGGGCCTG
GGCCTCCGCT ACATCTACTC GAACCTGGCT TCGGGCATTC AGGTCGAAGG CCGCGAGACC
AAGGCCGGCA AGTCCTTCGG GATGGATCTG GGCCTGCTGT ACCGGACGGC TCCGTTCAGT
CTGGGCGGCC AGACGAAGGC GCAGTTCTCG GCCGGCTTCA ACCTGAACAA CATGGGGCCC
CAGATCCAGT ACTCCGACGG CGCCCAGAAG GACCCGATCC CGACGAACCT GCGCTTCGGC
TATGCCTTTA CGATCGATCT GGACCCCTAC AATCGGATCA CCTTCGCCAA CGACTTCACG
AAGCTGCTCA TTCGCGTGCG GAGCGACTCG ACCGGCTCGC GGGCCGATCC CTTCTACAAG
GCGATCTTCA CGGCCTGGCG GCCGATCAAG GTGCGCACGA ACGCCCTCAA CGAAGAGGAA
GCCCGGTACC GCACGCTGAG CGTCTTCGAG CAGCTCATGA TCGGGATGGG TGTGGAGTAC
TGGTACAACC AGCTCTTCGC GCTGCGGACG GGCTTTTTCT ACGAGAACCC CTACAACGGC
AACCGGCAAT TTTTAACCTT CGGTGCCGGG TTGCGCTACA ACATCCTGGG CGTGGACTTT
TCCTATGTGT ACGCGCTCAA GGAGAACCAT CCGCTGGCCA ACACGATGCG CTTTTCGCTG
TTGCTGAACT TCAAGAAGTA G
 
Protein sequence
MRTVQHTLRT FCALLGLGLL LGWTYVARAQ VGGAAVLFLQ IEPDSRAAGM GNAGVAVADN 
AYALFWNPAG LAYQPEAVEV SLTHSNWLPE FNAGLYYEYL VGRFSVGKFG NMGAHVTLLN
MGEHEWRDEN NNPLGTFRSY DVAVGVSYGY PISERLALGL GLRYIYSNLA SGIQVEGRET
KAGKSFGMDL GLLYRTAPFS LGGQTKAQFS AGFNLNNMGP QIQYSDGAQK DPIPTNLRFG
YAFTIDLDPY NRITFANDFT KLLIRVRSDS TGSRADPFYK AIFTAWRPIK VRTNALNEEE
ARYRTLSVFE QLMIGMGVEY WYNQLFALRT GFFYENPYNG NRQFLTFGAG LRYNILGVDF
SYVYALKENH PLANTMRFSL LLNFKK