Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0964 |
Symbol | |
ID | 4787110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1023918 |
End bp | 1025627 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640089526 |
Product | hypothetical protein |
Protein accession | YP_001020161 |
Protein GI | 124266157 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00077396 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGA TATCGACATC GGCCTTTGGG CGGTCGTGTG CGGCGTCTCG TGCGCTGGGC ACGCGCTGCG TGGCGGCGGC GGCGCTCGCC CTGCCGTTTG GCCTGCCCGG GGGCGCGGCA CACGCCTTCG AGATCCGGAC CGACAACCCG GACCTCAAGC TCCGCTGGGA CAACACCTTC AAGTACAGCA ACGCGTTTCG CGTGAAGAGC CAGTCCGAGC GGCTGCTCCA GGACGTCAAC CTCGACGACG GCGACCGCAA CTTCGACAAG GGCCTGATCT CGAACCGCAT CGACCTGCTG TCCGAGGTCG ATGTGGGCTG GAAGAGCCTG GGCTTCCGTG CCAGCGGTGC GGCCTGGTAC GACACGGTCT ACAACCGCCG CAACGACCAC GACTCACCGG CCTCGGCCAA CTCCACCAGC GTCGGCCCCG ACCGCTTCAC CGACGCCACG CGCAAGCTGC ACGGGCGCAA GGCCGAGGTG CTCGACTTCT TCGCGTTCGG CAAGACCGAT CTCGGCTCGA TGCCGCTGAC GGTGCGGCTC GGTCGCCACA CGCTGATCTA CGGCGAGAGC CTGTTCTTCG GCTCGAACGG CATCGCGGCC GCGCAGGGGC CGGTCGACCT GGTGAAGCTG CTCACCGTGC CGAGTTCGCA GTTCAAGGAG ATCCTGCGGC CGGTGGAGCA GATCTCCAGC GTGCTCCAGA TCAATTCGCA GATGACGCTG GGCGCGTACT ACCAGTTCAA GTTCCGCGAG AGCATCATCC CGGCCGCCGG CAGCTACCTC AGCGCCTTCG ACTTCGTCGG CGACGGCGCC GAGCGCTTCA TCGTCGGCGC CCCCATCACC CCTGGCGGCG GCGCGGCGGC ATTCTGGCGC GGCGCCGACA TCGAGGCCAG GAACTCGGGG CAGGGCGGCC TGCAGTTCCG CTGGGCCCCG ACCGGGAGCG AATGGGAGTT CGGCGTCTAC GCGGCGCGCT ACCACGACAA GGGTGCCGCG CTGTATCTCA CGCCGTCGGC CGCGCCCGAC GTGGTGAGCG GCCGGGTCGG CGCGATCCAG CAGGTCTACC ACGAGGGCAT CAAGACCTAC GGCGCCAGCG CGACCACCTC GATCGGCCAG CTCAACCTGG CCTTCGAGGG CTCGATCCGC CGCAACGCCT CGCTGGTGAG CGACCCGCAG GTGGTGCTGC CGGGCGTGCT CGCCGACAAC GACGCCCATC CGCTCTACGC CGTCGGCAAC ACGGCGCATG CGCAGGTGTC GGGCATCTAC GTGCTGTCGG AGACGCGGCT GTGGGATGCC GGCGCCTTCC TCGGCGAGGT GGCGTGGAAC CGCCGGCTGA GCATCGACAA GAACCCGGGG GCGCTCGATC CCAACACCAC GCGCGATGCC GCGGCGCTGC GCTTCATCTT CGAGCCTTCG TACTTCCAGG TGGTCGACGG CGTCGACCTC TCGCTGCCGA TCGGCGTGGG CTACAACTTC TACGGCCGCT CGTCGTCGAT CTTCAACTTC AACGGCGGCA GCTCCAAGGG CGGTGACTTC TCGATCGGCG TGAAGGCGAC CTACCGCACG GTCTGGCAGG CCGGACTCAC CTACACCGGC TTCTATGGCG GCGAGGACAC CTTCCTCACG CCGGCCAACT CGCCCACGCC GGTGCTCTCC TACAAGCAGT TCTACAAGGA CCGCAACTTC ATTTCCTTCT CGCTGCAGCG TGCCCTTTGA
|
Protein sequence | MRKISTSAFG RSCAASRALG TRCVAAAALA LPFGLPGGAA HAFEIRTDNP DLKLRWDNTF KYSNAFRVKS QSERLLQDVN LDDGDRNFDK GLISNRIDLL SEVDVGWKSL GFRASGAAWY DTVYNRRNDH DSPASANSTS VGPDRFTDAT RKLHGRKAEV LDFFAFGKTD LGSMPLTVRL GRHTLIYGES LFFGSNGIAA AQGPVDLVKL LTVPSSQFKE ILRPVEQISS VLQINSQMTL GAYYQFKFRE SIIPAAGSYL SAFDFVGDGA ERFIVGAPIT PGGGAAAFWR GADIEARNSG QGGLQFRWAP TGSEWEFGVY AARYHDKGAA LYLTPSAAPD VVSGRVGAIQ QVYHEGIKTY GASATTSIGQ LNLAFEGSIR RNASLVSDPQ VVLPGVLADN DAHPLYAVGN TAHAQVSGIY VLSETRLWDA GAFLGEVAWN RRLSIDKNPG ALDPNTTRDA AALRFIFEPS YFQVVDGVDL SLPIGVGYNF YGRSSSIFNF NGGSSKGGDF SIGVKATYRT VWQAGLTYTG FYGGEDTFLT PANSPTPVLS YKQFYKDRNF ISFSLQRAL
|
| |