Gene Rmar_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1022 
Symbol 
ID8567662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1172669 
End bp1173904 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content65% 
IMG OID 
Productphage major capsid protein, HK97 family 
Protein accessionYP_003290304 
Protein GI268316585 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACCT TCGAACGTAT TCGCGAGCTT CGCGCTCAGC GCGAGCAGCT TGTGCGCGAA 
ATGGAGGCGA TGCTGCAGAA AGCCGAATCC GAAAAACGCG ACCTGACGGC CGAGGAGCGG
GCCGACTGGG ACAACTACCA GCAGCGTATT AACGAGCTCA CGAACGAAAT CAATGAACTG
GAGCGCCGGC TTGGCGGCTA CAATTTCGAT GCCTACAAGG CGGTTTATGA GCGGACCGAG
TCGGTCCGTG GCCTTGAGCT CTACCGCCGC GCCTGGCGCG CGTGGATTGC CGGTGATGAT
GATGCCTTAG GCCCTGACGA ACGCGAGGCC CTGCGCCAGG GCCTCGCGCG CCGCGCGCTC
GGCGTCGGCA CGCCGTCGGC CGGCGGCTAC CTGGTGCCGG AGACGATGCA GCGCCAGATC
GAGCGCCGGC TGGCCGAGAT TTCGCCGGTG ATCAATCTGG TGACGCGCAT TCGCACGGCC
AGCGGCGAGG ACCTGCTGAT TCCGACCGTT GACGACACGG CGAACAGCGC CACGATCGTG
TCCGAAAATA CGGCGCTTGC CGAGCAGGAC GTGTCGTTCG GGCAGGTCAG GCTCGGCGCC
TACACGTACT CGACGGGCAT CGTTCGCATC AGCCTCCAGC TGATGCAGGA CAGCGCCTTC
GACCTTGAGG CCTTCATGGC GGAGGTCTTT GCCGACCGTC TCGCTCGCGC GCTGCAGGAT
CACATCACGA ACGGTGACGG CACGACGCAG CCGGAGGGGA TCCTGACGGC GATTCCGTCC
GGCCAGATCG TACAGGGCGC CACAGGGCAG ACCACGTCGG TCACTTATGA TGACCTGGTG
GACCTGGTGC ACAAAGTCGA TCCGGCCTAC CGGTCGAGCC AGCGCGCGGC GTTCATGCTG
CACGACTCGA CGCTGGCCGC GCTGAAGAAG CTCAAGGACA ACCAGGGTCG GCCGATCTGG
CAGGAGGGCC TGCAGGCCGG TGAGCCAGCA CGGCTGCTCG GCTATCGAGT GATCATCAAC
AACGCGATGC CGCAAATGGC GGCTTCGGCG AAGTCGATCG TGTTCGGCGA CTTCAGCAAG
TACGTGCTGC GCGAGGTCGG CCCGGGCCTG GTCGTCAGGC GGCTGGATGA GCGCTACGCG
GAGTACCTGC AGTCGGCCGT TCTGGGCTTT GCGCGTTACG ATGGCCGAGT GCTCCAGCCG
TACGCGTTCG CTGCATACCA GAACTCGGCC TCCTAA
 
Protein sequence
MLTFERIREL RAQREQLVRE MEAMLQKAES EKRDLTAEER ADWDNYQQRI NELTNEINEL 
ERRLGGYNFD AYKAVYERTE SVRGLELYRR AWRAWIAGDD DALGPDEREA LRQGLARRAL
GVGTPSAGGY LVPETMQRQI ERRLAEISPV INLVTRIRTA SGEDLLIPTV DDTANSATIV
SENTALAEQD VSFGQVRLGA YTYSTGIVRI SLQLMQDSAF DLEAFMAEVF ADRLARALQD
HITNGDGTTQ PEGILTAIPS GQIVQGATGQ TTSVTYDDLV DLVHKVDPAY RSSQRAAFML
HDSTLAALKK LKDNQGRPIW QEGLQAGEPA RLLGYRVIIN NAMPQMAASA KSIVFGDFSK
YVLREVGPGL VVRRLDERYA EYLQSAVLGF ARYDGRVLQP YAFAAYQNSA S