Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1431 |
Symbol | |
ID | 5833622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1601926 |
End bp | 1603191 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367231 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001638903 |
Protein GI | 163850860 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.263231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA TGCGTTTCGA GACCAAGGCC CCCACCGGCC TGCCCGAGAA CAAGGCGGCG ACCTTCGGCA CCGATGCGGT GCTCGACGAG TTTGCCCGCG CCTTCGAGGC GTTCAAGGAG GCCAACGACG TCCGCCTCTC CGAGATCGAG ACCCGGCTCA CCGCGGATGT GGTGACCGAG GAGAAGCTCA TCCGCATCGA CGCCGCCCTC GATCAGGCGA AGAACCGCCT CGATCGGATC AGCCTCGACC GTGCCCGGCC GCCGCTCGGC GGGACGGAGC CGGCGCGCGA CGCCTCCGCC ACAGAGCACA AGGCGGCCTT CGACCTCTAT GTTCGAGCCG GCGAGAGCGC GGGTCTCAAG CGACTGGAAG AAAAGGCACT TTCCGCCGGC TCCGGGCCGG ATGGCGGCTA CCTCGTGCCG CCGACGATCG AGCGCGAGGT GCTGCGTCGG CTCGCCGAGA TCTCGCCGAT CCGCGCTATC GCCACGGTGC GGGCCGTCTC CGGCGGCCAG TACAAGCGCG CCGTCTCGGT CAACGGTCCC GCCGCGGGCT GGGTCGCCGA GACCGCGCCC CGTCCGCAGA CCGACACGCC GAACCTGTCC GAGCTGAGCT TCCCGGCCAT GGAACTCTAC GCCATGCCGG CGGCGACCCA GACGCTGCTC GACGACGCGG TGCTCGATAT CGATGCGTGG CTCGCCGAGG AAGTCGAGGC GGCCTTCGCC GAGCAGGAGA GTGTCGCCTT CGTCACGGGC AACGGCGTCG GTCGGCCGAA GGGCTTTCTC AGCTACGACA CCGTCGCCAA CGCGAACTGG GCTTCGGGCA GGCTCGGCTT CATCGCGACG GGGGCGGCCG GCGCCTTCCC CGCGAGCAAC CCGAGCGACG TGCTGTTCGA TCTGATCTAC GCGCTGCGCG CCGGCTACCG CCAGGGTGCG AGCTTCGTGA TGAATCGGCG GGTGCAGAGC GCGATCCGCA AGTTCAAGGA CGCCGACGGC AACTACCTCT GGCAGCCGCC GCTTGCCGCC GACCGGGCCG CGACGCTGAT GGGCTTTCCG CTGGTCGAAG CCGAGGCGAT GCCCGACATC GCCGCCGGCA GCCACGCCAT CGCCTTCGGC AACTTCAAGC GCGGCTACCT CGTCGTGGAC CGCGTCGGCC TTCGGACCCT GCGCGATCCC TACTCCGCCA AGCCCTACGT GCTGTTCTAC ACCACCAAGC GCGTCGGCGG CGGGGTGCAG GACTTCGCCG CGATCAAGCT GCTCCGGTTC GCCTGA
|
Protein sequence | MTEMRFETKA PTGLPENKAA TFGTDAVLDE FARAFEAFKE ANDVRLSEIE TRLTADVVTE EKLIRIDAAL DQAKNRLDRI SLDRARPPLG GTEPARDASA TEHKAAFDLY VRAGESAGLK RLEEKALSAG SGPDGGYLVP PTIEREVLRR LAEISPIRAI ATVRAVSGGQ YKRAVSVNGP AAGWVAETAP RPQTDTPNLS ELSFPAMELY AMPAATQTLL DDAVLDIDAW LAEEVEAAFA EQESVAFVTG NGVGRPKGFL SYDTVANANW ASGRLGFIAT GAAGAFPASN PSDVLFDLIY ALRAGYRQGA SFVMNRRVQS AIRKFKDADG NYLWQPPLAA DRAATLMGFP LVEAEAMPDI AAGSHAIAFG NFKRGYLVVD RVGLRTLRDP YSAKPYVLFY TTKRVGGGVQ DFAAIKLLRF A
|
| |