Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4820 |
Symbol | |
ID | 5833886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5385172 |
End bp | 5386059 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370617 |
Product | bacteriochlorophyll/chlorophyll a synthase |
Protein accession | YP_001642259 |
Protein GI | 163854216 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases |
TIGRFAM ID | [TIGR01476] bacteriochlorophyll/chlorophyll synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0428449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC CCGCGCCGAG CGCCATCGTC GAGCTTCTCA AGCCCATCAC GTGGTTCGCG CCGATGTGGG CGTTTGGCTG CGGGGTCGTC TCCTCCGGTC AGACGCCATC CGGACAGTGG CTGGTCATCG CCGCGGGCGT GCTGCTCGCC GGACCGCTGG TCTGCGCCAC GAGCCAGGCC GCCAACGACT GGTTCGATCG CCACGTCGAT GCCATCAACG AGCCTGACCG GCCGATCCCC TCGGGACGCA TTCCCGGCCG GTGGGGCCTT TATCTCGCGG CCGGCTGGAC GGTCCTGTCG CTGGCTGTGG CCGCGATGCT CGGACCCTGG ATCCTCGGTG CCGCCCTGTT CGGCCTCGTG CTTGCCTGGA TCTACTCGGC ACCTCCATTC CGGCTGAAGC AGAATGGCTG GTGGGGCAAT TCGGCGGTGG CGCTCTGCTA CGAGGGGCTG CCCTGGTTCA CCGGCGCCGC GGTGATGGCT GCTTCGATGC CCGACCGGCG GGTGCTTCTC GTCGCCCTGC TCTACTCGAT CGGCGCGCAC GGCATCATGA CCCTGAACGA CTTCAAGTCG GTGGAGGGGG ACCGGGCCAT GGGCCTGCGC TCGCTGCCGG TGCAGCTCGG CTCCGATCGC GCGGCGCGCT TCGCCTGCCT CGTCATGGCC CTGCCGCAGA TGGTGGTGGT CGCGCTCCTC CTCCATTGGG AGCGTCCCTG GCATGCCGCG CTGATCGGCG CCCTGCTCGT GGGGCAGCTC GTCCTGATGA CGCATTTTCT GAAGGCGCCC CGCGCCCGCG CCGCTTGGTA CAACGGCACC GGCACGACGC TCTACGTCTT CGGCATGTTG GCCTCGGCTT TTGCCCTGCG CCCGCTCGTG CAAGGGCTGG CGCCATGA
|
Protein sequence | MATPAPSAIV ELLKPITWFA PMWAFGCGVV SSGQTPSGQW LVIAAGVLLA GPLVCATSQA ANDWFDRHVD AINEPDRPIP SGRIPGRWGL YLAAGWTVLS LAVAAMLGPW ILGAALFGLV LAWIYSAPPF RLKQNGWWGN SAVALCYEGL PWFTGAAVMA ASMPDRRVLL VALLYSIGAH GIMTLNDFKS VEGDRAMGLR SLPVQLGSDR AARFACLVMA LPQMVVVALL LHWERPWHAA LIGALLVGQL VLMTHFLKAP RARAAWYNGT GTTLYVFGML ASAFALRPLV QGLAP
|
| |