Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pmen_3970 |
Symbol | |
ID | 5110193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas mendocina ymp |
Kingdom | Bacteria |
Replicon accession | NC_009439 |
Strand | - |
Start bp | 4349925 |
End bp | 4351166 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640505233 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001189449 |
Protein GI | 146308984 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.33538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTC AAGCCAAGCG GGAGCAACGC AGCGCCCTGG CGAAAGAAAC CCGCGCCCTG ATGGACGCGA ACACTGGCGA CAATTGGGGC GCTGAACAGC AGGCCAAATA CGATGGCCTG GTGGCGCAGA TCGACCGCCT GGACGGCGAG ATCGAGCGCG CCCAGAAGCT GCTGGACATT GAGGCCAAGC AGAAGCACAG CACCCGCAGT CGTGCCGAGC GCGATGGCAT CAGCGACGAT GAGGCCGAAA GCCGCATTCT CGATGACAAA GCCATCTTCC GCGCCTGGGT GGCCGGCGGC ATCGACAACC TTACTGCCGA GCAGCGCGAG ATCGTGGCCA AGCGCCGCGA GGAAGTGCGC AACACCATGA GCACCACCAC CGGCAGCGAA GGTGGCTATC TGGTGCCGCG CGAGTTCTCG GCCAATCTGC TGGAGGCCTT GAAGGACTTC GGCGGCATGC GCGCCGTGGC GCAGGTCATC CGCACCGAAA CCGGCGCCGC CATGGACTGG CCGACCACCG ATGCCACTTC GGAAGAGGGC GAGATCGTTG GTGAAAACGC CGAGGTTGAC AGCCAGGACG CCACCTTCGG CACCCTGGCA CACGTGGTCT ACAAGTTCAG CTCCAAGGAC ATTGCTGTGC CCTTCGAGCT GCTGCAAGAC AGCGCCATCG ACCTCGAAGC GCACATCAAC CAGCGCCTGA CCGAGCGCCT GGGTCGTATC ACCAATCGTA TGTTCACCAC CGGCACCGGC ACCAACCAGC CGCACGGCGT GGTGACTGGC GCGGCTGCCG GCAAGGTGGG CGCCACTGGT AAAGCTACCT CGGTCGGCTG GGAGGATCTG GTTGACCTCG AACACAGCGT TGACCCGGCC TACCGCCGTT CCGGCAACTG CTCGCTGATG TTCCACGACA CCACCTTGCG CGAGCTGAAG AAACTGAAAG ATGCCGACGG CCGCCCGATC TGGCTGCCAG GTGTGGATGT GGCCGAGCCG GCTACCCTGA TCGGCATGCG TTACACCATC AATCAGGACA TGCCGGTGAT GGCCGCCAAT GCCAAGTCGA TCCTGTTCGG TGACTTCAGC CGCTACATCA TCCGCGACGT GATGCAGGTG CTGCTGTTCC GCATGACCGA CTCGGCCTAC ACCCGCAAGG GCCAGGTTGG CTTCCTGGCG TTCATGCGCT CGGGCGGTCG CCTGATGGAT GTGGGCGGCG CGCTGAAGTA CTACCAGAAC TCGGCGACCT GA
|
Protein sequence | MSIQAKREQR SALAKETRAL MDANTGDNWG AEQQAKYDGL VAQIDRLDGE IERAQKLLDI EAKQKHSTRS RAERDGISDD EAESRILDDK AIFRAWVAGG IDNLTAEQRE IVAKRREEVR NTMSTTTGSE GGYLVPREFS ANLLEALKDF GGMRAVAQVI RTETGAAMDW PTTDATSEEG EIVGENAEVD SQDATFGTLA HVVYKFSSKD IAVPFELLQD SAIDLEAHIN QRLTERLGRI TNRMFTTGTG TNQPHGVVTG AAAGKVGATG KATSVGWEDL VDLEHSVDPA YRRSGNCSLM FHDTTLRELK KLKDADGRPI WLPGVDVAEP ATLIGMRYTI NQDMPVMAAN AKSILFGDFS RYIIRDVMQV LLFRMTDSAY TRKGQVGFLA FMRSGGRLMD VGGALKYYQN SAT
|
| |