Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1433 |
Symbol | |
ID | 5833624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1603754 |
End bp | 1604926 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641367233 |
Product | HK97 family phage portal protein |
Protein accession | YP_001638905 |
Protein GI | 163850862 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.273066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGAT TTATCGCGCG GCTCGCGAGG GCGGCCGGGT TCGTCCCTGA GACGAAAGCG AGCGCGGCTT TTGCGCTCTA CGGCGAGGGA CGAGCGATCT GGACCGCGCG CGATTGCGCG GCTTTGGCCC GCGAGGGTTT CCAGCGCAAT GCCGTCGTCC ACCGCTCGGT TCGGCTCATC GCCGAGGCCG CGGCCTCCCT GCCGCTGACG CTGGCCCGCG CCGACGATGC CCATCCGCTG CTGGACTTGC TCGCCCGGCC GAATCCGCGC GAAGGCGGGA TGCGCTTCCT CGACGGGATC TATGGGCACC TGCTCGTCTC CGGCAATGCA TACATCGAAG CGGTCGAGAT CGATGGTCGA CCTCGTGAAC TGTTCTCCCT GCGTCCCGAC CGGATGCAGG TCGTGGCCGG CGCCGACGGC TGGCCCGCGG CTTACGAGTA CGCCGTCGGG GGGCGCCGAC TCCGCTACCA GCAGACCGGC GCCGTGCCGC CGATCCTGCA CCTGACGCTG TTCAACCCGC TCGACGACCA TTACGGCCTT TCGCCGATGG AGGCGGCGGC GGTCCCGCTC GACATCCACA ACGCGGCCGG CGCCTGGAAC AAGGCCCTGC TCGACAACGC CGCCCGCCCG TCCGGCGCTC TGGTCTTCGC GCCCTCGACC GGCGCCGCCT TGAGCGACAC GCAGTTCACG CGGCTCAAGG CCGAGCTGGA AACGAGCTAC CAGGGCAGCG CCAATGCCGG CCGGCCGCTC CTCCTCGATG GCGGGCTCGA TTGGCGCCCG CTCTCGCTCT CACCGAAGGA GATGGACTTC GTCGAGGCGA AGGCCGCCGC TGCCAGGGAG ATCGCGCTCG CCTTCGGCGT GCCGCCGCTC TTGCTCGGTC TTCCCGGCGA CAACACCCAC GCGAATTACG CCGAAGCCAA CCGTGCCTTC TACCGTCAGA CGGTGATCCC GCTGGTGCGC CGCACTGCCG ATTCCCTGGC GCGCTGGCTG GAGCCCGCCT TCGGCCCCGC GCGGTTGGAG CCGGATCTCG ACGCGGTCGA AGCGCTGGCG ACCGAGCGCG AGTCGCTCTG GCGCCGGGTG CAAGGCGCGG ACTTCCTGTC GGTCGCCGAG AAGCGCGAGG CCGTCGGCTA CCCTCCCCAG AGTCCGGGGC AAGGCACGGG CTCTCCGGCC TGA
|
Protein sequence | MPGFIARLAR AAGFVPETKA SAAFALYGEG RAIWTARDCA ALAREGFQRN AVVHRSVRLI AEAAASLPLT LARADDAHPL LDLLARPNPR EGGMRFLDGI YGHLLVSGNA YIEAVEIDGR PRELFSLRPD RMQVVAGADG WPAAYEYAVG GRRLRYQQTG AVPPILHLTL FNPLDDHYGL SPMEAAAVPL DIHNAAGAWN KALLDNAARP SGALVFAPST GAALSDTQFT RLKAELETSY QGSANAGRPL LLDGGLDWRP LSLSPKEMDF VEAKAAAARE IALAFGVPPL LLGLPGDNTH ANYAEANRAF YRQTVIPLVR RTADSLARWL EPAFGPARLE PDLDAVEALA TERESLWRRV QGADFLSVAE KREAVGYPPQ SPGQGTGSPA
|
| |