Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1022 |
Symbol | |
ID | 8567662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1172669 |
End bp | 1173904 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_003290304 |
Protein GI | 268316585 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACCT TCGAACGTAT TCGCGAGCTT CGCGCTCAGC GCGAGCAGCT TGTGCGCGAA ATGGAGGCGA TGCTGCAGAA AGCCGAATCC GAAAAACGCG ACCTGACGGC CGAGGAGCGG GCCGACTGGG ACAACTACCA GCAGCGTATT AACGAGCTCA CGAACGAAAT CAATGAACTG GAGCGCCGGC TTGGCGGCTA CAATTTCGAT GCCTACAAGG CGGTTTATGA GCGGACCGAG TCGGTCCGTG GCCTTGAGCT CTACCGCCGC GCCTGGCGCG CGTGGATTGC CGGTGATGAT GATGCCTTAG GCCCTGACGA ACGCGAGGCC CTGCGCCAGG GCCTCGCGCG CCGCGCGCTC GGCGTCGGCA CGCCGTCGGC CGGCGGCTAC CTGGTGCCGG AGACGATGCA GCGCCAGATC GAGCGCCGGC TGGCCGAGAT TTCGCCGGTG ATCAATCTGG TGACGCGCAT TCGCACGGCC AGCGGCGAGG ACCTGCTGAT TCCGACCGTT GACGACACGG CGAACAGCGC CACGATCGTG TCCGAAAATA CGGCGCTTGC CGAGCAGGAC GTGTCGTTCG GGCAGGTCAG GCTCGGCGCC TACACGTACT CGACGGGCAT CGTTCGCATC AGCCTCCAGC TGATGCAGGA CAGCGCCTTC GACCTTGAGG CCTTCATGGC GGAGGTCTTT GCCGACCGTC TCGCTCGCGC GCTGCAGGAT CACATCACGA ACGGTGACGG CACGACGCAG CCGGAGGGGA TCCTGACGGC GATTCCGTCC GGCCAGATCG TACAGGGCGC CACAGGGCAG ACCACGTCGG TCACTTATGA TGACCTGGTG GACCTGGTGC ACAAAGTCGA TCCGGCCTAC CGGTCGAGCC AGCGCGCGGC GTTCATGCTG CACGACTCGA CGCTGGCCGC GCTGAAGAAG CTCAAGGACA ACCAGGGTCG GCCGATCTGG CAGGAGGGCC TGCAGGCCGG TGAGCCAGCA CGGCTGCTCG GCTATCGAGT GATCATCAAC AACGCGATGC CGCAAATGGC GGCTTCGGCG AAGTCGATCG TGTTCGGCGA CTTCAGCAAG TACGTGCTGC GCGAGGTCGG CCCGGGCCTG GTCGTCAGGC GGCTGGATGA GCGCTACGCG GAGTACCTGC AGTCGGCCGT TCTGGGCTTT GCGCGTTACG ATGGCCGAGT GCTCCAGCCG TACGCGTTCG CTGCATACCA GAACTCGGCC TCCTAA
|
Protein sequence | MLTFERIREL RAQREQLVRE MEAMLQKAES EKRDLTAEER ADWDNYQQRI NELTNEINEL ERRLGGYNFD AYKAVYERTE SVRGLELYRR AWRAWIAGDD DALGPDEREA LRQGLARRAL GVGTPSAGGY LVPETMQRQI ERRLAEISPV INLVTRIRTA SGEDLLIPTV DDTANSATIV SENTALAEQD VSFGQVRLGA YTYSTGIVRI SLQLMQDSAF DLEAFMAEVF ADRLARALQD HITNGDGTTQ PEGILTAIPS GQIVQGATGQ TTSVTYDDLV DLVHKVDPAY RSSQRAAFML HDSTLAALKK LKDNQGRPIW QEGLQAGEPA RLLGYRVIIN NAMPQMAASA KSIVFGDFSK YVLREVGPGL VVRRLDERYA EYLQSAVLGF ARYDGRVLQP YAFAAYQNSA S
|
| |