Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0922 |
Symbol | |
ID | 4285254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1020163 |
End bp | 1021374 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638140390 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_756153 |
Protein GI | 114569473 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.255185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAAGG AAACCAAGAT GACCGCCGCG ACCGGTGACA GCCGCGCGGT GATGGGCGAG CTGCTGGCTG CCTTCGAACA GTTCAAACAG GCCAATGACC AGCGCCTCGC CGAGATCGAG ACCCGCGCCG CTGCCGATGT GCTGCTGGAG GACAAGGTCG CCCGCATCGA CAAGGCGCTC GACAGCCAGA AATCCGCGCT CGACCGTCTC GTCCATGAAG CGGCCCGTCC CGGCCTCGCT CCGGGTGCCG ACAGGTCGGC GGCCTCGACG GGCTTTGCCG CCTATATGCG CAGCGGCCAG CTGGCCGAAG GCAAGTCCGC CACGGCCGGC ACGCCCGGTG AGGGCGGTCA TGTCGTGCCG GCCGAAACCG AGGCCCGGAT CGACCGGCTG CTGGCCGAAG CCTCACCCAT TCGCGCCATC GCCACGGTGC GCCAGACCGC GTCCGGCACC TTCCGCAAGC CGGTCTCGCG CGGCGGGGCG GCGACGGGCT GGGTGTCCGA AACGGCGGCC CGCCCGGAAA CCGATGCGCC CAGTCTCGAA CTGATCGAGT TTCCCGCCGC CGAGCTTTAC GCCATGCCGG CCGCCACCCA GCAATTGCTC GATGACGCCA TGGTCGATGT CGAGGACTGG CTGGCCGAGG AGGTCCGCGA CGTCTTCGCC GCCCAGGAAA GCGCCGCCTT TGTCTCCGGC GACGGCATCA ACAAGCCGCG TGGCCTGCTG GACTACACTG CCGTCGCCGA GGGCACCCAA GCCTGGGGCG AGCTCGGCTA TGTCGCCACC GGCACGGCGG GCGGGTTTGA CGCGACAGAT CCCGCCGACG CGTTGATCGA TCTCATCTAT GCACCCAAGA CCGCCTATCG CGCCAAGGGC CGCTTCCTGA TGAACCGCCA GACCGTCTCC GCCGTGCGCC GCTTCAAGGA TGCCGACGGC AATTATCTCT GGCAGCCGGC GCTGGGCGAG GGGGCGAGTT CGACCCTGCT CGGCTATCCG GTCACCGAGG CCGAGGACAT GCCCGATATC GGCACTGACA GCGCCTCGAT CGCTTTTGGT GACTTCGCCC GCGGCTATCT GGTGCTGGAC CGCCAGGGCG TCGAAGTCCT GCGCGACCCG TTCAGCGCCA AACCCTATGT CCTCTTCTAC ACCACCAAGC GCGTCGGCGG CGGAGTGCAG GATTTCGAAG CGATCAAGCT GCTGAAGTTT GGTGTGAGCT GA
|
Protein sequence | MSKETKMTAA TGDSRAVMGE LLAAFEQFKQ ANDQRLAEIE TRAAADVLLE DKVARIDKAL DSQKSALDRL VHEAARPGLA PGADRSAAST GFAAYMRSGQ LAEGKSATAG TPGEGGHVVP AETEARIDRL LAEASPIRAI ATVRQTASGT FRKPVSRGGA ATGWVSETAA RPETDAPSLE LIEFPAAELY AMPAATQQLL DDAMVDVEDW LAEEVRDVFA AQESAAFVSG DGINKPRGLL DYTAVAEGTQ AWGELGYVAT GTAGGFDATD PADALIDLIY APKTAYRAKG RFLMNRQTVS AVRRFKDADG NYLWQPALGE GASSTLLGYP VTEAEDMPDI GTDSASIAFG DFARGYLVLD RQGVEVLRDP FSAKPYVLFY TTKRVGGGVQ DFEAIKLLKF GVS
|
| |