Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_2686 |
Symbol | |
ID | 6311223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | - |
Start bp | 2909697 |
End bp | 2911097 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642651408 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_001925376 |
Protein GI | 188581931 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCT CCGAGCGCCT GAAAGCGAAG TACCTCGAAC TCGTCGCCAA GTCGAAGGGG ATGATCGAGG CCGCGGAGAA GCGCGAGAAC AAGGACTTCA CCGACCAGGA ATCGAAGGAC TTTGACGCGC TGACGGAGGA GATGGACGGC ACCTTCAAGG CCTATGAGCA GGCGCTCAAG GCGGAGAAGG CTCAGGCCGG CGAGGAGTCC GTGCCGACGA ACGAGACCAA CGGCAACGCC AATCAGGCAG CCGGCGCGGG CGAGCTCGCT GAGGGCGAGC GCGAGAAGCT TTACGCTCGC ACCGAGACCA AGATGGAGAC CGGCCAGAAG ATCTCGCTCA TCGCCGCCTC GGTGGTGAAG GCGAAGCTGT CCGGCGGCGA GAAGAACGCG TTCCAGGTGC TCAGCGACGA GGGCTTCCCT CAGTTTGCTC TCGACATCCA GCGCGGCAGC CGGAACAAGT CCTCGAACAC CCTGACCCCG GCCGCGGGCG GTGTGCTGCT GCCGACGCCG CTGGCGGCGG AGGTCGTGCC GTTCCTGCGT CCCGAGACCA CGTTCCTTCA GCTCAACCCC GTGCGCGTGC CGCTCACCGC CGGCCAGTAC AATCAGCCGG TCGGTGCAAC CGGCGCCGTG GCGCAGTACG TCGGCGAGGG CCAGAAGAAG CCCGTCACCG ACGTGACCTT CGACAAGCTC GGTCTGAAGG CGAAGAAGCT GGCCGCGATC ATCCTCCTGA CGAAGGAGGC CAAGAAGTGG ACCATCATCG ACATCCAGGC CTACATCGAG CGCGAGTTGC GCAACGCAGG CGGCCAGACC CTCGACCTCA ACGGCTGGCT CGGTACCGGT GCGAACGCCG ACACCCCGAC CGGCATCCTG AACGTATCGG GCGTCGGCGT CGTCACGCAC ACCTTCGCCG ATCCGAAGGC GCCGACCCTG AAGGAACTCG ATGCAGCAGC GTCGAAGCTG ATCCTCTACA TGACCCTCCG GTTCATCCCG GAGACGTCCC GTTGGGCGTG GGTCATGAAC CCGCGGACCC TGCGCTATCT CGCGGACATG CGCGTCGGCG CCGGCACCGA TGGCGAATAC GCCTTCCCTG AACTGCAGGG CGAGAACCCG CGGTGGAAGG GCAAGCGCGT GCTGGTCTCG ACGCAGATCC CGGCGAACCT CGGCACCGGT CTCGACGAGT CGATCCTCGC CCTGGTCAAC GCCGACGACG TGATCTTCGG CGAGGAAGAG GATGTCAGCC TCGACTTCTC CATGGAGGCG ACGATCGACG TCGGCGGCAC CCTCGTCCAC CTGTTCCAGC AGAACATGTG GGGCGTGCTC ATGGAGATGG CCCACGACTT CGGCCTCCGC CGCAAGGCCT CGGTGGTCCG CCTCAACGGC GTGCGCTGGG GCGCCCCGTA G
|
Protein sequence | MKRSERLKAK YLELVAKSKG MIEAAEKREN KDFTDQESKD FDALTEEMDG TFKAYEQALK AEKAQAGEES VPTNETNGNA NQAAGAGELA EGEREKLYAR TETKMETGQK ISLIAASVVK AKLSGGEKNA FQVLSDEGFP QFALDIQRGS RNKSSNTLTP AAGGVLLPTP LAAEVVPFLR PETTFLQLNP VRVPLTAGQY NQPVGATGAV AQYVGEGQKK PVTDVTFDKL GLKAKKLAAI ILLTKEAKKW TIIDIQAYIE RELRNAGGQT LDLNGWLGTG ANADTPTGIL NVSGVGVVTH TFADPKAPTL KELDAAASKL ILYMTLRFIP ETSRWAWVMN PRTLRYLADM RVGAGTDGEY AFPELQGENP RWKGKRVLVS TQIPANLGTG LDESILALVN ADDVIFGEEE DVSLDFSMEA TIDVGGTLVH LFQQNMWGVL MEMAHDFGLR RKASVVRLNG VRWGAP
|
| |