Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0308 |
Symbol | |
ID | 4786917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 334230 |
End bp | 335336 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640088860 |
Product | hypothetical protein |
Protein accession | YP_001019505 |
Protein GI | 124265501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00122162 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCCCGC AGCGCAGCAG TTCGGCCTGG CCCCTGGCGG CGGCCTGCGC CTGCCTGATG GTCTACGCCA GCCTGCACCC GTTCACCGGC TGGGAATGGC CGACCGCGAC CGACATTCGC TGGTGGCTGC TCCCGGTGCC CAAGCCGCGT GGCGTGGGGC AGTTCGACCT GGTCAGCAAT CTGCTCGCCT ACGTGCCGCT CGGGGCGTTG CTGACAGGTG GACTGCTGCG CGGCGGGGGC CGCGCCTGGC TGGCGTTCGC CGTCGCGGTC GGGGTGTCCA GCGGTCTGAG CTATGTGCTC GAGACCCTGC AGCACCTGTT GCCGAGGCGG GTGCCGTCGA TCATCGACTG GAGCCTGAAC ACCGCCGGTG CGGCGCTCGG CGCGGTGCTG ATGCTGGTGC TGCACGCGCT CGGGCTGCTC GGCCACTGGC AGCGCTGGCG TGAGCGCTGG CTGCTGCGAG ACCGCGGCGC CGGCCTGACG CTGCTGCTGT TGTGGCCGTT CGGCCTGCTG TTCCCGCCGC CGCTGCCCTT CGGACTCGGC CATGTGCTGG ACCGCGCGCG CGACCTGCTC GCCGAAGGAC TGGAGGACAC GGCCTGGGAC GGCTGGCTCG GCGCGACCGC GACCGTCAGC GAACCGCTGG CGCCGGGGCT CGAGATGTTG GGCGTGGCAG CCGGCCTGCT GGCGCCCTGC CTGCTTGCCT ACGCGCTGAC GCGGCCCGGT CCGCGCCGCC TGGTGCTGCC GGCCGGGGCG CTGCTGCTGG GCATCGCGGC CACCACGCTG TCGACGGCGT TGAACTTCGG GCCCGAGCAC GCGCTGACCT GGTGGACACC GCCGGTGTTG CCGGCGATCG GTGTGGTGGC CGTCGTCGCC GTGGGGCTGA CGTGGCTGCC GGCGCGCGCA TCGGCCGCCG TGGCGCTGCT GGTCATCAGT TTCGGCGTGG CGCTGGTCAA CATCGCGCCC GGCGACGCCT ACTACGCGGC CAGCCTGCAA TCGTGGGAGC AGGGCCGTTT CATCCGCTTC CACGGCCTGT CGCAATGGAT CGGCTGGCTG TGGCCCTGGG CGACACTCGC TTACCTGCTC GGTCGTGTCG CAGCCCGGGA CGAATAG
|
Protein sequence | MAPQRSSSAW PLAAACACLM VYASLHPFTG WEWPTATDIR WWLLPVPKPR GVGQFDLVSN LLAYVPLGAL LTGGLLRGGG RAWLAFAVAV GVSSGLSYVL ETLQHLLPRR VPSIIDWSLN TAGAALGAVL MLVLHALGLL GHWQRWRERW LLRDRGAGLT LLLLWPFGLL FPPPLPFGLG HVLDRARDLL AEGLEDTAWD GWLGATATVS EPLAPGLEML GVAAGLLAPC LLAYALTRPG PRRLVLPAGA LLLGIAATTL STALNFGPEH ALTWWTPPVL PAIGVVAVVA VGLTWLPARA SAAVALLVIS FGVALVNIAP GDAYYAASLQ SWEQGRFIRF HGLSQWIGWL WPWATLAYLL GRVAARDE
|
| |