Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3801 |
Symbol | |
ID | 4785970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 4019223 |
End bp | 4020731 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640092384 |
Product | cysteine proteinase |
Protein accession | YP_001022989 |
Protein GI | 124268985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0184346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCC ACCGGGCTCG CGCAGAATCG CGGCGCATGA GAGCAAGCCG TTTCCTCCGT CGCGCGGCCA CGCTGTCGGC TCTGTGGCTG GCCGGCGCGG CCGCTGCGCA GGACGCGGCG CCGACGGTCT GTCGCGTCAA AGGCCTGAAG CACGAGGTGT TGTGCGGCCA TGTGACGCGG GCCCTGGATC CGGCGCAGCC GGGGGGCACG ACGGTCACGG TGCACTACGT GGTCGTGCCG GCGGCGGCGC GTCACAAGCG AGCCGACCCG ATCTTCTTCT TTGCCGGCGG CCCTGGACAG AGCGCCATCG CGCTGGCCGG TTCGGTGCTG CCGCTGTTCC AGCGCCTGAA CAACCGGCGT GACCTGGTCT TCATCGATCA GCGCGGCACC GGGCGCTCGG CGCCCCTGGC CTGCGACGCC GAGGACGAGC TGCCGCTGGC GCAGCGCTTC GATGCCGAGC GTGGTCGCCA GCGTCTCGCC GCCTGCCTGG CTTCGCTGCG AAAGCTGCCG CATGGCGATC TGCGCCAGTA CACGACGAGC ATCGCGATGG CCGATGCCGA TGCGGTGCGG GCCGCACTGG GTGCGTCGCA GATCAACCTG GTGGGCGGCT CCTACGGCAC GCGGGCGGCG CTGGACTATC TGCGGCAGTT CCCGTCGCAT GTGCGCCGTA TCGTGCTCGA CGGCGTCGCG CCGCCCGACA TGGTGCTGCC CGCCAGCATG GGCCAGGATG TGGAAGCCGC GCTGGCGCGG CTTTTCACCG ACTGCGAGCA GGAGCCAAGC TGCCAGGCGC GCCACCCGCG GCTGCGGGCG CACTGGCAGG GCCTGCTGAG CGCCGCGCCT CGGCCAGCGA GCGTGGTCGA TCCGCTGGAT GGCCGGCCGG CCACGGTGAG GATCGATGTC GATCTGCTGG CCAACGCGGT GCGCGGGCCG CTGTACGCGC CGGGCCTGGC CGCGGCCCTG CCCTTCGCGA TCGACGAGGC GGCCGCCGGG CGCTACGCGG CCCTGGTCGG ATTGGCCGGG GTGCTGGGCG GCGGGCCGCG GACGACGCGG CTGTTCGAGG GCCTGCATTT CTCGGTGGTG TGCGCAGAGG ATGCGCCGGA CGCCGCGGCT CCGCCGCCGT CCGGGCTGGG TGCCGTGTAC CTGCGTCCCT ATGCGGCGCT GTGCCGCGAC TGGCCGCGCG GCAGTGTGCC GCCGACTTTC CGCGACCTGC CGACCAGCCA GGTTCCGGTG CTGGCCCTCA GCGGCACGCT CGACCCGGTG ACGCCGCCGC GCCACGGCGA GCGGGTGGTG AAGGCGCTCG GGCCGCGGGC ACGTCATGTG GTGGTACCGA ATGCCGGTCA CGGCGTGATG GCGATCGGCT GCACGCGCGA GCTGCTGTAC CGCTTCATCG ACGCGGACGA CGAGGCCCAG GCCCTGGCGG TCGACGCCGG GTGCCTGGCG CACCTGCCGC GCCCGCCGGC GTTCGAGCCG CCGCGGCCCG GACCGTCGCT GGCGGGAGCG GCGCGATGA
|
Protein sequence | MPRHRARAES RRMRASRFLR RAATLSALWL AGAAAAQDAA PTVCRVKGLK HEVLCGHVTR ALDPAQPGGT TVTVHYVVVP AAARHKRADP IFFFAGGPGQ SAIALAGSVL PLFQRLNNRR DLVFIDQRGT GRSAPLACDA EDELPLAQRF DAERGRQRLA ACLASLRKLP HGDLRQYTTS IAMADADAVR AALGASQINL VGGSYGTRAA LDYLRQFPSH VRRIVLDGVA PPDMVLPASM GQDVEAALAR LFTDCEQEPS CQARHPRLRA HWQGLLSAAP RPASVVDPLD GRPATVRIDV DLLANAVRGP LYAPGLAAAL PFAIDEAAAG RYAALVGLAG VLGGGPRTTR LFEGLHFSVV CAEDAPDAAA PPPSGLGAVY LRPYAALCRD WPRGSVPPTF RDLPTSQVPV LALSGTLDPV TPPRHGERVV KALGPRARHV VVPNAGHGVM AIGCTRELLY RFIDADDEAQ ALAVDAGCLA HLPRPPAFEP PRPGPSLAGA AR
|
| |