Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1842 |
Symbol | |
ID | 5833904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2061341 |
End bp | 2062915 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367641 |
Product | dihydropteroate synthase DHPS |
Protein accession | YP_001639312 |
Protein GI | 163851269 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR00284] dihydropteroate synthase-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC CCGAACATCT CGTCTTCATC ACCGGCAAGC TGGCCCATGC CCGCCTCGAA AAGGTCGCGG CCACGCTGCC GGCCGAGCGG TTCACCTGGA GCATCGCCGA TGCCGGGGTG AAGGTCGCCG CGCTGATGAC CGAGGAAATC ATCAAGCGGC GGGTGCAGAT GCCCGAGGGC GCGACCCGGA TCGTCCTGCC CGGCCGCTGC CGCGCCAACC CGGAGGCGCT GGCGCAGCAT TTCGGCCTCC CGGTGGAGCG GGGGCCGGAC GAGATCGTTG ATTTGCCGGC TTATCTTGGC CTGACCGGGC GCAAGGTCGA TCTCTCGCGC TACGACCTGC GCATCTTCTC CGAGATCGTC GACGCCTCGA AGATGACGCC CGACCAGATC CTGGCCAAAG GTCTCGACCT CGCTCGCCGC GGAGCCGACG TGATCGATCT CGGCGGGCTG CCCGATACGG CGTTCCCGCA TCTGGAGGAC AGCGTGCGGG CGCTGAAAGG GGCCGGGCTG AAGGTCAGTG TCGATTCCTT CTCCCTCGAT GAGCTGACCC GCGGGGCGCG GGCCGGCGCC GACTTCCTGC TGAGCCTCAA CGAGGAGACG CTGGATCTCG CCTTCGAGAC CGACGCGGTG CCGATCCTCG TGCCGATGCG CCCCGACGAC CTGCCCTCCC TCGACCGCGC CATCGAGCGG ATGGAGCGGG CGGGCCGGCC CTACATGGCT GATCCGATCC TGGAGCCGAT CCATTTCGGC TTCGTCGACT CGATCGTCCG CTACCGCGAG ATCCGCGCGC GCTGGCCGAA CATCGAGATG ATGATGGGCA CCGGCAACCT CACCGAACTC ACCGAGGCCG ACAGCCTCGG CGTCACGGCG CTCCTCGTCG GCATGTGCTC GGAACTCGCC ATCCGCAACG TGCTGATCGT GCAGGTCTCG AACCACACCC GCCGCACGGT GGAGGAGCAC GATGCTGCCC GCCGGGTGAT GTACGCGGCA CGAGAGGACG CCGCCCTGCC CAAGGGCTAC GGCCGCGAGT TGCTGGCGCT GCATGACAAG CGCCCCTTCG TGCAGACCTC CGATGAAATT GCCGCCCTGG CCGCGGAGGT GCGCGATCCC AATTACCGCA TCGCCGTCGC CGAGGACGGC ATCCACGTCT ACAACCGCGA CCGTCACACC ACCGGCACCG ACGCGATGGC CTTCTTCCCC GAACTGAGCG TGGAGAGCGA CGGCGCGCAC GCCTTCTATC TCGGCGGAGA ACTGACGAAG GCCGAGACCG CGTTCCGCCT CGGCAAGCGC TACGTGCAGG ACGAACCCCT CGATTGGGGC TGCGCCGCCG ACCGGACCCA GGAAGACACC ACCGCCTTCA AGGCGGCCGG GCCGACGAAA GCCGCCCACA CCAAGCATAG CGGCCCCGAA GCGCCCACCG CCGAGCGCGC GACCCGGACC GATCCGGAGC GCGACGCCGC ACCGCCGCGG ACCGAGACCG GAAGCGGCAC GGCATCGGAA CGCGACCCGC TCAGCGAGCC GAAGGGCGGC CGCATCGTCT GCGGCCGGCT GGTGCCCGAC GAGGACCGGA ATTAG
|
Protein sequence | MSAPEHLVFI TGKLAHARLE KVAATLPAER FTWSIADAGV KVAALMTEEI IKRRVQMPEG ATRIVLPGRC RANPEALAQH FGLPVERGPD EIVDLPAYLG LTGRKVDLSR YDLRIFSEIV DASKMTPDQI LAKGLDLARR GADVIDLGGL PDTAFPHLED SVRALKGAGL KVSVDSFSLD ELTRGARAGA DFLLSLNEET LDLAFETDAV PILVPMRPDD LPSLDRAIER MERAGRPYMA DPILEPIHFG FVDSIVRYRE IRARWPNIEM MMGTGNLTEL TEADSLGVTA LLVGMCSELA IRNVLIVQVS NHTRRTVEEH DAARRVMYAA REDAALPKGY GRELLALHDK RPFVQTSDEI AALAAEVRDP NYRIAVAEDG IHVYNRDRHT TGTDAMAFFP ELSVESDGAH AFYLGGELTK AETAFRLGKR YVQDEPLDWG CAADRTQEDT TAFKAAGPTK AAHTKHSGPE APTAERATRT DPERDAAPPR TETGSGTASE RDPLSEPKGG RIVCGRLVPD EDRN
|
| |