Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0944 |
Symbol | |
ID | 4787327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 999425 |
End bp | 1000435 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640089506 |
Product | hypothetical protein |
Protein accession | YP_001020141 |
Protein GI | 124266137 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCCC AGTTTGAGAA ATTCGGAGGT GGTGCGCGCG GCGCGGTCAA CCTGCTCACC TCGGCCCTGC CGATCGCCAT CATCGGCGGC CTGCTCTACG CCGGCTTCTT CGTGAAGGCC GAGGCGGTCA TCAAGAAGGT CGAGCCGAAG GCGGTCGAAC GCCGCGACAA CTTCTTCAGC ATCGCCACGC CGAACGACCA GGTGGCCTGG GCCGCAGGCA GCGGCGGCAA GATCGTCCAC ACGGTCGATG GCGGCAAGAC CTGGCAGCGG CAGTCGACCG CGACGCTGGA GAACCTGCAG GGCATCGCCG CGTGGGACGC GATGCACGCT GTGGCGGTGG GCAACAACGG CGTGATCCTC GTCACCACCA ACGGCGGCAA TCTCTGGACG GCGGCCACGC TGCCGAGCTC CGGCAACCCG AACAAGCTGT TCCGCGTGCG CATCTTCGAC GGCGTGGCCT GGGCGGTCGG CGAGTTCGGC GCGCTGCTGC GCTCCGACGA CAAGGGCCAG ACCTGGACGC GCGCGCTGCC CGAGAAGGAC CGCGCCTGGA ATGCCGTGAG CTTCATCGGT CAGACCGGCT GGCTGGTCGG CGAGTTCGGC GCGGTGATGC GCAGCACCGA CGGCGGCGCC AACTGGACCG ACATCGAGAC CAAGAACAAG GTCAGCCTGA TGGCGGTGAG CTTCCGTGAC CCGCAGCACG GCGTGGCCGT GGGCCTCGCG GGCACGCTGG TCGTCACGAA CGACGGCGGG CTCACCTGGA GCGACGTCGA ACGCCCGACG CGCGAGCACC TGCTCGACGT CATCTGGGAC GAGAACCGCT GGACCGCGGT CGGCGACAAG GGGGTCATGG TGAGCTCCGA TGCCACGGCG CAGACCTGGA AAGCCCGCCG CATCTCGGAC GGCGACGTCT CGTGGCGCAC CCAGATCGCG AAGTCCGGCC CGCGCTACTA CCTGGCCGGC GCCAACCTCG CCGTGCTCGA AGGCGACCAG CTGACCGTCG CCGGTCGCTG A
|
Protein sequence | MLAQFEKFGG GARGAVNLLT SALPIAIIGG LLYAGFFVKA EAVIKKVEPK AVERRDNFFS IATPNDQVAW AAGSGGKIVH TVDGGKTWQR QSTATLENLQ GIAAWDAMHA VAVGNNGVIL VTTNGGNLWT AATLPSSGNP NKLFRVRIFD GVAWAVGEFG ALLRSDDKGQ TWTRALPEKD RAWNAVSFIG QTGWLVGEFG AVMRSTDGGA NWTDIETKNK VSLMAVSFRD PQHGVAVGLA GTLVVTNDGG LTWSDVERPT REHLLDVIWD ENRWTAVGDK GVMVSSDATA QTWKARRISD GDVSWRTQIA KSGPRYYLAG ANLAVLEGDQ LTVAGR
|
| |