Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1961 |
Symbol | |
ID | 4784747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2099599 |
End bp | 2100669 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640090531 |
Product | hypothetical protein |
Protein accession | YP_001021154 |
Protein GI | 124267150 |
COG category | [S] Function unknown |
COG ID | [COG4394] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.12391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC CGCTGCGATG GGACCTGTTC TGCCACGTGA TCGACAACTT CGGCGATGTC GGCGTGTCGT GGCGGCTCGC TGCCGACCTG GCGCGGCGCG GTCAGCAGGT CCGGCTGTGG ATCGACGACC CGTCGGCACT CGCCTGGATG GCCCCGGCCG GGCAACCCGG CGTCGAGGTG CTGCGCTGGG CCGAGCCGCT GCCGGACCGC GAGCCGGGCG ACGTGGTCGT CGAGACCTTC GGGTGCCAGC TCCCTGCGAC CTTCGTGCAG CGCATGCGGC GCCCCTGCCC GCCGGTGTGG ATCAACTTCG AGTACCTGAG CGCCGAGTCC TACGTGGAGC GCAGCCATGG GCTGCCCTCG CCGCAGCTCG AAGGTCCCGG CCAGGGGCTG AGCAAGTGGT TCCTCTACCC CGGCTTCACG CCGCGCACCG CCGGCCTGAT CCGCGAACCT GAGTTGCTGC CGCGCCGTGC GGCCTTCGAC GCCACGGCCT GGCTGGCATC CCACGGCGTG CAGCGCCAGC AGGGTGAACG CGTGGTCAGC CTGTTCTGCT ACGAAAACGC GGCCTTGCCA ACCTGGCTGG ACAGCCTGGC GGAGGTGCCT ACGGTCCTCC TGGTGACGCC GGACCGGGCG GCACGGCAGG TGCGGTCCGC GCTCGGCGAC GGCGGTCGGA CGGGTGCGCT GCGAACCGTG ATGTTGCCCT ATCTGCCACA GGACGAGTTC GATCACCTGC TGTGGGCCAG CGACCTCAAC TTCGTGCGCG GAGAGGATTC GTTCTCGCGC GCGCAATGGG CGGGCGTGCC CTTCATCTGG CAGATCTACC CGCAGGTGGA CGACTTCCAT GCCGTCAAGC TGGACGCCTT CCTGGACCGT TACCTCGACG CGGCCGCACC CGCCCAGGGT GTGCAGATCC GCGCGCTATG GCACGGGTGG AACGGTTTGT CCGGCGTTAC CCGGCCGACC TGGCCCGAAG GGCAGGATTG GCAGCGCCTT GCCCGCGACT GGAGCGCGCA CCTGGCCGCC CTGCCCGACG CCACCGACTG TCTGCTTCGC TTCGCGGCGG ACAGAGGCTA A
|
Protein sequence | MTQPLRWDLF CHVIDNFGDV GVSWRLAADL ARRGQQVRLW IDDPSALAWM APAGQPGVEV LRWAEPLPDR EPGDVVVETF GCQLPATFVQ RMRRPCPPVW INFEYLSAES YVERSHGLPS PQLEGPGQGL SKWFLYPGFT PRTAGLIREP ELLPRRAAFD ATAWLASHGV QRQQGERVVS LFCYENAALP TWLDSLAEVP TVLLVTPDRA ARQVRSALGD GGRTGALRTV MLPYLPQDEF DHLLWASDLN FVRGEDSFSR AQWAGVPFIW QIYPQVDDFH AVKLDAFLDR YLDAAAPAQG VQIRALWHGW NGLSGVTRPT WPEGQDWQRL ARDWSAHLAA LPDATDCLLR FAADRG
|
| |