Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3791 |
Symbol | |
ID | 4785960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 4008908 |
End bp | 4010704 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640092374 |
Product | hypothetical protein |
Protein accession | YP_001022979 |
Protein GI | 124268975 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000380463 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGCGCAG CGGCTTCGCT CGAGCTGGGT CTGGTCGGCA ACTGCGCGAT CAGCGCGCTG ATCGACCGCG AGGCCCGCAT CGTCTGGTGC TGCCTGCCGC GCTTCGACGG CGACCCGACC TTCCACGCGC TGATCGACAC GCCCGACGCG TGGCCGGCCG ACGGCAGCTG GTGCATCGAG ATCGAGAACT TCGTGCGCAG CGAGCAGGCC TACGACGAAG GCACCGCCAT CCTGCGCACG CGGCTGTTCA GTGCCGACGG CGATGTGATC GAGATCACCG ATTTCGCACC GCGCTTCCTC AGCCGCGACC GCACCTTCCG GCCCGCGATG CTGGTGCGGC GTGTGCATCC GCTGCAGGGC CATCCGCGCG TGCGCGTCAG CCTGCGGCCT CGCGCCGACT GGGGGCGCGT CGCGCCGGAG GTCGCGCGCG GCAGCCATCA CTTGCGATAC GCCGGCCCGA CCGGGACGCT GCGTCTCACC ACCGATGCGC CCATCACCTA TGTGCAGGAC GAGACCTGGT TCTCGCTGGC GGCGCCGCTG AACCTGCTGC TCGGCCCCGA CGAGACGCTG GAGCGCGGCG CGGCCGAGGT GGCGCGCGAT TTCGAGGAAC TCACCGCGCT CTACTGGCGC ACCTGGACCC GCCGGCTGGC CCTCCCGCTG GAATGGCAGG CGGCCGTTAT CCGCGCCGCG ATCACGCTCA AGCTGTGCCT GTTCGACGAG ACCGGCGCGA TCGTCGCGGC GATGACCACC AGCCTGCCCG AGGCGCCGGG CAGCGGCCGC AACTGGGATT ACCGGTACTG CTGGCTGCGC GACGCCTTCT TCGTGGTGCG GGCGCTCAAC AGCCTGTCCG AACTGGAGAC GATGGAGGAC TACCTGCGTT GGCTGCACAA CGTGGTGCGC GACGCGCGCG GGGGCTTCAT CCAGCCGCTG TACGGCCTGG GCCTTGAGAA GGATCTGCCC GAGGAAGAAC TCCCGCATCT GCGCGGCTAC CGCGGCATGG GTCCGGTGCG GCGCGGCAAC CAGGCGCACA CGCACGCGCA GCACGACGTG TACGGCAACG TGCTGCTCGG GGCGTCGCAG TCCTTCCATG ATCTGCGGCT GTTCCGCCGC GCTGACGCCG ACGACTTCGC GCGGCTCGAG GCGGTGGGTG AGCAGGCCTG GCTGGTGCAC GACCAGCCCG ATGCGGGAAT GTGGGAGCTG CGCACGCGCG CGCGGGTGCA CACCTCGTCG GCGCTGATGT GCTGGGCGGC CTGCGATCGG CTGGCGAAGA TCGCCACCCG TCTGGGCCTG CCCGAGCGGG ACGCCCACTG GCGCGGCCGG GCCGCGGCGA TCCGAGAGCG CGTGCTGCGC GAAGCCTGGA GCGACAAGCG CCAGGCCTTC GCCGAGAGCC TGGGTGGCGA GAATCTCGAT GCCAGCGTGC TGCTGATGGC CGAAGTGGGC TTCATCGACC CGATGGACCC GCGCTTCGTC GCGACGGTCG ACGCGCTGGA GGCGCACCTC TGCGATGGCC CCTACATGCG TCGCTACGAG GCGGCCGACG ACTTCGGCAA ACCTGAGACG GCCTTCAACA TCTGCGCCTT CTGGCGCATC GACGCGTTGG TGCGCATCGG CCGGCGCGAG CAGGCCCGGC AGATCTTCGA ATCCATGCTC GCGTCGCGCA ATGCGCTGGG CCTGCTCTCG GAAGACACCG ACGCGCGCAC CGGCGAGCTG TGGGGCAACT TCCCGCAGAC CTATTCGATG GTCGGCATCA TCAATGCCGC AATGCGTCTG TCGGTGCCCT GGGACACGGT GATCTGA
|
Protein sequence | MSAAASLELG LVGNCAISAL IDREARIVWC CLPRFDGDPT FHALIDTPDA WPADGSWCIE IENFVRSEQA YDEGTAILRT RLFSADGDVI EITDFAPRFL SRDRTFRPAM LVRRVHPLQG HPRVRVSLRP RADWGRVAPE VARGSHHLRY AGPTGTLRLT TDAPITYVQD ETWFSLAAPL NLLLGPDETL ERGAAEVARD FEELTALYWR TWTRRLALPL EWQAAVIRAA ITLKLCLFDE TGAIVAAMTT SLPEAPGSGR NWDYRYCWLR DAFFVVRALN SLSELETMED YLRWLHNVVR DARGGFIQPL YGLGLEKDLP EEELPHLRGY RGMGPVRRGN QAHTHAQHDV YGNVLLGASQ SFHDLRLFRR ADADDFARLE AVGEQAWLVH DQPDAGMWEL RTRARVHTSS ALMCWAACDR LAKIATRLGL PERDAHWRGR AAAIRERVLR EAWSDKRQAF AESLGGENLD ASVLLMAEVG FIDPMDPRFV ATVDALEAHL CDGPYMRRYE AADDFGKPET AFNICAFWRI DALVRIGRRE QARQIFESML ASRNALGLLS EDTDARTGEL WGNFPQTYSM VGIINAAMRL SVPWDTVI
|
| |