Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0104 |
Symbol | |
ID | 4787707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 96307 |
End bp | 97728 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640092513 |
Product | hypothetical protein |
Protein accession | YP_001023118 |
Protein GI | 124262648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00000595624 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCTCG AACACATCCC GGAACCACGA CTACGGTTCG CGTCCGGCGA ACACATCTGC CCGCGCAGGG GCATCGCAGC TTACGGCGTG TTCGATCGGA GCATGGACTC CCGCCGCACT GACGTCATGA TTGGCGGGGT AGGCACCGCT ACATGCATCG AAGCGCTCGG CCGGTGGGTT GAGCGCTGCA GTTCGGAGAT TCCTGCGCCG GAGACGGCGA AGCAACCGAA CCTGCGGGTG CCGTTCCCCG GGGTCGGCCG CGGCCACGCT TTCGATGCCA AGCTGGTCTT TGGCAGTGAC CTCGCTCGGA CACTGAAGAA AAGCGAGGTC GACGAAATCG TCGCCATTGG CGACCGAACC ACCAGGCTGT CGAAGGCTAT TGACCTTTAC TACGAGCACA TAAAGTTCCT TGCGCAGAAT CGGCAGATCG ACGCCGTGGT CTGCGTGATT CCCGATGCTC TCTACAAGGT GGTAGCGACG GAGGAGTCGA ATCCGCTCGA AGAGACACTT GATGCAAGTG TCGAGGTGGC GTCGGAGCTG AACTTCCGGC GCGCACTGAA GGCCAAGGCC ATGCACTTGG GCAAGCCGTT GCAGCTCATA CGAGCCTTCT CGCTTGAGAG CAACAAGAAG GGACAGCAGG ACGATGCCAC TAAGGCATGG AACTTCTGCA CGGCGCTCTA CTACAAGGCT GGGCCACGCG TTCCATGGAA GTTGTCAGCC GACGACAGGC GACCTTCATC TTGCGCGGTC GGGATTGCGT TCTATCGCAG TAGAGATCGA CAGGTGCTCA ACACCAGCTT GGCGCAGATC TTCGATGAGT TGGGCAACGG TCTGATCCTT CGTGGCACCC CGATCGACAT GACTCGGGAT GACCGAGTTC CCCACCTCAA TGCCCAGCAG GCCTACGACC TTCTAACTGC CGCACTCAAC GAATACAGAG TCGCGTTGCG CAACTTTCCA GCGAGGATCG TGGTCCATAA GTCGTCGAAC TTCTCAGCGG AAGAGATCGA CGGCCTCAGC GAGGCGGCCT CCGACCTGAG GATTGATACC GTTGATTTGG TCACCGTGAT GGACTCGAGG TTGCGTCTCT TTCGGGAGGG AAACTATCCT CCGTATCGCG GGACAAGGAT TGAGATGGAC GACCGCCGCC ACGTCCTGTA TTCCCGGGGC TCGGTTTGGT ACTACAAGAC CTACACCGGG CTCTACATCC CCGAGCCTAT TGAGTTGCGA ATCGTGCGGT CCGAGGAGTC TCCGTCGTTC ATCGCTCGCG AGATTCTGGG ACTGACCAAA ATGAACTGGA ACAACACGCA GTTCGATGGA AAGTACCCTG TTACGCTCGG ATGCGCGAGA AAGGTCGGCG AGATCATGAA GTACCTGAGC GACCGGGACG ATCCGCAGAT TCGCTACGGC TTCTACATGT GA
|
Protein sequence | MKLEHIPEPR LRFASGEHIC PRRGIAAYGV FDRSMDSRRT DVMIGGVGTA TCIEALGRWV ERCSSEIPAP ETAKQPNLRV PFPGVGRGHA FDAKLVFGSD LARTLKKSEV DEIVAIGDRT TRLSKAIDLY YEHIKFLAQN RQIDAVVCVI PDALYKVVAT EESNPLEETL DASVEVASEL NFRRALKAKA MHLGKPLQLI RAFSLESNKK GQQDDATKAW NFCTALYYKA GPRVPWKLSA DDRRPSSCAV GIAFYRSRDR QVLNTSLAQI FDELGNGLIL RGTPIDMTRD DRVPHLNAQQ AYDLLTAALN EYRVALRNFP ARIVVHKSSN FSAEEIDGLS EAASDLRIDT VDLVTVMDSR LRLFREGNYP PYRGTRIEMD DRRHVLYSRG SVWYYKTYTG LYIPEPIELR IVRSEESPSF IAREILGLTK MNWNNTQFDG KYPVTLGCAR KVGEIMKYLS DRDDPQIRYG FYM
|
| |