Gene Mext_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1842 
Symbol 
ID5833904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2061341 
End bp2062915 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID641367641 
Productdihydropteroate synthase DHPS 
Protein accessionYP_001639312 
Protein GI163851269 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR00284] dihydropteroate synthase-related protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC CCGAACATCT CGTCTTCATC ACCGGCAAGC TGGCCCATGC CCGCCTCGAA 
AAGGTCGCGG CCACGCTGCC GGCCGAGCGG TTCACCTGGA GCATCGCCGA TGCCGGGGTG
AAGGTCGCCG CGCTGATGAC CGAGGAAATC ATCAAGCGGC GGGTGCAGAT GCCCGAGGGC
GCGACCCGGA TCGTCCTGCC CGGCCGCTGC CGCGCCAACC CGGAGGCGCT GGCGCAGCAT
TTCGGCCTCC CGGTGGAGCG GGGGCCGGAC GAGATCGTTG ATTTGCCGGC TTATCTTGGC
CTGACCGGGC GCAAGGTCGA TCTCTCGCGC TACGACCTGC GCATCTTCTC CGAGATCGTC
GACGCCTCGA AGATGACGCC CGACCAGATC CTGGCCAAAG GTCTCGACCT CGCTCGCCGC
GGAGCCGACG TGATCGATCT CGGCGGGCTG CCCGATACGG CGTTCCCGCA TCTGGAGGAC
AGCGTGCGGG CGCTGAAAGG GGCCGGGCTG AAGGTCAGTG TCGATTCCTT CTCCCTCGAT
GAGCTGACCC GCGGGGCGCG GGCCGGCGCC GACTTCCTGC TGAGCCTCAA CGAGGAGACG
CTGGATCTCG CCTTCGAGAC CGACGCGGTG CCGATCCTCG TGCCGATGCG CCCCGACGAC
CTGCCCTCCC TCGACCGCGC CATCGAGCGG ATGGAGCGGG CGGGCCGGCC CTACATGGCT
GATCCGATCC TGGAGCCGAT CCATTTCGGC TTCGTCGACT CGATCGTCCG CTACCGCGAG
ATCCGCGCGC GCTGGCCGAA CATCGAGATG ATGATGGGCA CCGGCAACCT CACCGAACTC
ACCGAGGCCG ACAGCCTCGG CGTCACGGCG CTCCTCGTCG GCATGTGCTC GGAACTCGCC
ATCCGCAACG TGCTGATCGT GCAGGTCTCG AACCACACCC GCCGCACGGT GGAGGAGCAC
GATGCTGCCC GCCGGGTGAT GTACGCGGCA CGAGAGGACG CCGCCCTGCC CAAGGGCTAC
GGCCGCGAGT TGCTGGCGCT GCATGACAAG CGCCCCTTCG TGCAGACCTC CGATGAAATT
GCCGCCCTGG CCGCGGAGGT GCGCGATCCC AATTACCGCA TCGCCGTCGC CGAGGACGGC
ATCCACGTCT ACAACCGCGA CCGTCACACC ACCGGCACCG ACGCGATGGC CTTCTTCCCC
GAACTGAGCG TGGAGAGCGA CGGCGCGCAC GCCTTCTATC TCGGCGGAGA ACTGACGAAG
GCCGAGACCG CGTTCCGCCT CGGCAAGCGC TACGTGCAGG ACGAACCCCT CGATTGGGGC
TGCGCCGCCG ACCGGACCCA GGAAGACACC ACCGCCTTCA AGGCGGCCGG GCCGACGAAA
GCCGCCCACA CCAAGCATAG CGGCCCCGAA GCGCCCACCG CCGAGCGCGC GACCCGGACC
GATCCGGAGC GCGACGCCGC ACCGCCGCGG ACCGAGACCG GAAGCGGCAC GGCATCGGAA
CGCGACCCGC TCAGCGAGCC GAAGGGCGGC CGCATCGTCT GCGGCCGGCT GGTGCCCGAC
GAGGACCGGA ATTAG
 
Protein sequence
MSAPEHLVFI TGKLAHARLE KVAATLPAER FTWSIADAGV KVAALMTEEI IKRRVQMPEG 
ATRIVLPGRC RANPEALAQH FGLPVERGPD EIVDLPAYLG LTGRKVDLSR YDLRIFSEIV
DASKMTPDQI LAKGLDLARR GADVIDLGGL PDTAFPHLED SVRALKGAGL KVSVDSFSLD
ELTRGARAGA DFLLSLNEET LDLAFETDAV PILVPMRPDD LPSLDRAIER MERAGRPYMA
DPILEPIHFG FVDSIVRYRE IRARWPNIEM MMGTGNLTEL TEADSLGVTA LLVGMCSELA
IRNVLIVQVS NHTRRTVEEH DAARRVMYAA REDAALPKGY GRELLALHDK RPFVQTSDEI
AALAAEVRDP NYRIAVAEDG IHVYNRDRHT TGTDAMAFFP ELSVESDGAH AFYLGGELTK
AETAFRLGKR YVQDEPLDWG CAADRTQEDT TAFKAAGPTK AAHTKHSGPE APTAERATRT
DPERDAAPPR TETGSGTASE RDPLSEPKGG RIVCGRLVPD EDRN