Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0540 |
Symbol | |
ID | 4787363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 577080 |
End bp | 578246 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640089099 |
Product | putative major capsid protein |
Protein accession | YP_001019737 |
Protein GI | 124265733 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.272419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGA TTGCAGACAT CGCCGAAGAG CTCGACAAAA GCAACAAGTC CCTTGAGGCT TGGCGCGACA GGAAAGACAA GGAATTCGAG GAAGTGCGGC AGTTGGTTCA AGACTTCGCC AAGAAGTCGA ACCGCCCCGG TGCCGGCTTC GCGTCGCCGA TGCCTCTGTC GGACGCTCAG CGCAAGTCGC TCGATCTCGG CGCTCGTGCC CTGATCGCAG GCGACCAGGC CCGCGCTGAG AAGCACTTCG TCGAGGCCAA GGCCATGCAC ACCGGCAACG ACCCGGCCGG CGGCTACATG GTGCTGCCTC AGATGTCGGA CGAGGTCACG CGGGTGATGC TCGAAACCTC GCCGCTACTG AGTTCGGTTC GCCTCGTGGA TCTCGAGCCG ACCGACGAGT TCGAAGAGAT CGTCGACCGC GACGAGGTGG GCGTCACCTG GGTGGGCGAG CTAGACAGTC GCTCCGAGAC CACGGCGCCG GCCCTGGGTA AGTTCATCGT GCCGCTGCAT GAGGTGTACG CGATGCCGGC CGTGAGCCAG AAGCTAATCG ACACCGCGTC GTTCAACGTG ATGGACTGGC TCAACACCAA GATCGGCGAG AAGTTCGGCT TCTCGTTCGG CGCGTCGATC ATCGCTGGTG ACGGCGCCGC GAGACCGTCC GGGTTCACCA ACTACCCGAT CGCCGCCACG GCTGATGCCA CGCGCCCCTG GGGCACGCTG CAGTACCTGC CGACCGGCAC CGATGGCGCT TTCAACAGCA CCACGAAGAC GGACGTTCTG GTGGACACGG TGGCCGCTCT CAAGCCGCAG TACCGCACCG GCGCTTCCTG GGTGATGAAC CGCTCCACCG CGGCGACGCT GCGCAAGCTC AAGACGACCG ACGGCGAATC GCTCTGGGCC GCGTCGACCA CGGCTGGCCT GCCGTCCACG CTGCTGGGCT ACCCCGTGAT CGAAGACGAC CAGATGGCAT CGATCGCGAC GAACTCGCTG TCGATCGCCT TCGGCAACTT CCGGCGCGGC TACACGTTCG TTCGTCGTCT GGGCACACGC TTCCTGGTGG ACCCGTACAC CAACAAGCCC AAGGTGAACC TGTACGCCTA CCAGCGTGTC GGTGGCGCTG TCGCCAACTT CGAAGCGATC AAGCTGGTTC GCTTCTCTGC AGCCTGA
|
Protein sequence | MMTIADIAEE LDKSNKSLEA WRDRKDKEFE EVRQLVQDFA KKSNRPGAGF ASPMPLSDAQ RKSLDLGARA LIAGDQARAE KHFVEAKAMH TGNDPAGGYM VLPQMSDEVT RVMLETSPLL SSVRLVDLEP TDEFEEIVDR DEVGVTWVGE LDSRSETTAP ALGKFIVPLH EVYAMPAVSQ KLIDTASFNV MDWLNTKIGE KFGFSFGASI IAGDGAARPS GFTNYPIAAT ADATRPWGTL QYLPTGTDGA FNSTTKTDVL VDTVAALKPQ YRTGASWVMN RSTAATLRKL KTTDGESLWA ASTTAGLPST LLGYPVIEDD QMASIATNSL SIAFGNFRRG YTFVRRLGTR FLVDPYTNKP KVNLYAYQRV GGAVANFEAI KLVRFSAA
|
| |