Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1940 |
Symbol | |
ID | 4786701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2076133 |
End bp | 2077260 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640090510 |
Product | putative secreted substrate binding protein |
Protein accession | YP_001021133 |
Protein GI | 124267129 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.15659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0532415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTCA AACTCAAAGC CATCGCCCTT GCAACGGCCC TGCTCGCGAC CGGCGCCGTC TCGGCCCAGG AAGTGATCAA GATCGGCCAC GTCGCCCCCA TCTCCGGTGC CCAGGCCCAC TACGGCAAGG ACAACGAGAA CGGCGCCCGG ATGGCGATCG AGGAACTCAA CACCCAGAAC ATCACGATCG GTGGCAAGAA GGTCAAGTTC GAACTGGTTG CGGAAGACGA CGCTGCCGAC CCGAAGCAGG GCACGGCCGC CGCCACCAAG CTGTGCGATG CCAAGGTCAA CGGTGTGGTC GGTCACCTGA ACTCCGGCAC CACCATTCCC GCCTCGAAGA TCTACAACGA CTGCGGCATT CCTGAGATCA CCCCGTCGGC CACGAACCCC AAGTACACGC AGCAGGGCTT CAAGACCGCT TTCCGCATCC TGGCCAACGA CAACGCGCTC GGCGCCGGCC TGGCTTTGCA CGCCGCCAAC AACCTGAAGC TCAAGAAGGT CGCGATCATC GATGACCGCA CTGCCTACGG GCAGGGTGTG GCCGAGGTGT TCAAGAAGAC TGCCCAGGCC AAGGGCATCC AGATCGTCGA TGAGCAGTAC ACCACCGACA AGGCCACCGA TTTCATGGCG ATCCTGACCT CGATCAAGTC GAAGGGTCCG GATGGCGTGT TCTACGGCGG CATGGACCCG CAAGCCGGCC CGATGCTGCG CCAGATGGAG CAACTCGGCC TGTCGAACGT CAAGTTCTTC GGCGGCGACG GCGTGTGCAC CGCCAAGCTC GCCGACCTGT CGGCCGGCGC CAAGACGCTG GGCAACGTGG TCTGCGCCGA AGGCGGCTCC TCGCTCGAGA AGATGCCCGG CGGTACCGCC TGGAAGGCCA AGTACGACGC GAAGTATCCC GGCCAGTTCC AGGTCTACTC GCCCTACGTC TACGACGCGG TATTCGTGCT GGTCGACGCC ATGAAGCGCG CCAACTCGGC CGACCCCAAG GTCTACGGCC CGAAGCTGTT CGAAACCAAC TACACCGGCG TGACCGCGAA GGTGGCCTTC GAGAGCGATG GTGAACTGAA GAACCCGGCG ATGACCCTGT ACGTCTACAA GGACGGCAAG AAGGTCCCGC TGAACTGA
|
Protein sequence | MQLKLKAIAL ATALLATGAV SAQEVIKIGH VAPISGAQAH YGKDNENGAR MAIEELNTQN ITIGGKKVKF ELVAEDDAAD PKQGTAAATK LCDAKVNGVV GHLNSGTTIP ASKIYNDCGI PEITPSATNP KYTQQGFKTA FRILANDNAL GAGLALHAAN NLKLKKVAII DDRTAYGQGV AEVFKKTAQA KGIQIVDEQY TTDKATDFMA ILTSIKSKGP DGVFYGGMDP QAGPMLRQME QLGLSNVKFF GGDGVCTAKL ADLSAGAKTL GNVVCAEGGS SLEKMPGGTA WKAKYDAKYP GQFQVYSPYV YDAVFVLVDA MKRANSADPK VYGPKLFETN YTGVTAKVAF ESDGELKNPA MTLYVYKDGK KVPLN
|
| |