Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1941 |
Symbol | |
ID | 5831825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2161313 |
End bp | 2162212 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367741 |
Product | squalene synthase HpnC |
Protein accession | YP_001639411 |
Protein GI | 163851368 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | [TIGR03464] squalene synthase HpnC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.692084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CGCTTCAGAC CGCCAGCGAC GCCCGCACCG GCAAGGGCCA GCACGACGAG AACTTTCCCG TCGCCTCGCA CCTGATCCAT CCGCGCTACC GCGGCGCGAT CCTGGCCTTC TACAACTACG TGCGGGCCGG CGACGACGTT GCCGACCACA TCGGCCTGTC GCCCGAGCGC AAGATCGAGA TGCTCGACGC GCTCGCCGAC GCGCTGACCG GCAAGGGCGG CTCCGACCCC ACGGTCGAGC CGCTCAAGCG GGAACTCGCG GCCCATCGCC AGCCGCCGAC CCACGCGCTC GAACTGCTCG ACGCCTTCCG CATGGATGCG CGCAAGTCGC GCTACGCGGA TTGGGACGAG CTGATCCACT ATTGCCGCTA CTCGGCCATG CCGGTCGGGC GCTTCCTGCT CGACGTGCAC GGCGAGGATC CGGCGCGGGT CTACCGGACC TCCGACGCGA TTTGCGCCGC GCTGCAGGTG CTCAACCACC TTCAGGATTG CGGCAAGGAT TTTCGCGATC TCGACCGGGT CTACATCCCC CTCGACGTCA TGCGGAAGCA TGGCGCCGAC GTGTCGATGC TGGGCGCCGA CCGGGCGAGC CCGCAACTGC GCGCGGTGAT CCGCGAACTC GCCGAGCGCA CCCTGGTGCT GCTGGACGAG GGGGCGCCGC TGCCGAACCA GATCGACGAC CTGCGCTTGA GCCTGGAGAT CGCCGCGATC CATCGCCTCG CCGTCGTCCT GACGAAGGGG TTGCTGACCC GCGATCCGCT CAGCGAGAAA GTCCATCACG GTAAGGCCGC CTTCGCGCTC ACCGCGCTCG GCGGCATCGC CGCCACGCTG GTGCGCCGCC CGTTCCGGTC GCGTGCGCCC CGTCCCGCCC CCGCCGGAGC CGGCCGATGA
|
Protein sequence | MSAALQTASD ARTGKGQHDE NFPVASHLIH PRYRGAILAF YNYVRAGDDV ADHIGLSPER KIEMLDALAD ALTGKGGSDP TVEPLKRELA AHRQPPTHAL ELLDAFRMDA RKSRYADWDE LIHYCRYSAM PVGRFLLDVH GEDPARVYRT SDAICAALQV LNHLQDCGKD FRDLDRVYIP LDVMRKHGAD VSMLGADRAS PQLRAVIREL AERTLVLLDE GAPLPNQIDD LRLSLEIAAI HRLAVVLTKG LLTRDPLSEK VHHGKAAFAL TALGGIAATL VRRPFRSRAP RPAPAGAGR
|
| |