Gene Mchl_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2177 
Symbol 
ID7116123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2275052 
End bp2276626 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID643524927 
Productdihydropteroate synthase DHPS 
Protein accessionYP_002420952 
Protein GI218530136 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR00284] dihydropteroate synthase-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.330618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGC CCGAACATCT CGTCTTCATC ACCGGCAAGC TGGCCCATGC CCGCCTCGAA 
AAGGTCGCGG CCACGCTGCC GGCCGAGCGC TTCACCTGGA GCATCGCCGA TGCCGGGGTG
AAGGTCGCCG CGCTGATGAC GGAGGAAATC ATCAAGCGCC GGGTGCAGAT GCCCGAGGGC
GCGACCCGGA TCGTCCTGCC CGGCCGCTGC CGCGCCAACC CGGAGGCGCT GGCCCAGCAT
TTCGGCCTCC CGGTGGAGCG GGGGCCGGAC GAGATCGTTG ATTTGCCGGC TTATCTTGGT
CTGACCGGGC GTAAGGTCGA TCTCTCGCGC CACGATCTGC GCATCTTCTC CGAGATCGTC
GACGCCTCGA AGATGACGCC CGACCAGATC CTGGCCAAGG GTCTCGACCT CGCCCGCCGC
GGGGCCGACG TGATCGACCT CGGCGGGCTG CCCGACACGG CGTTCCCGCA TCTGGAGGAC
AGCGTCCGGG CGCTGAAAGG CGCTGGCCTC AAGGTCAGCG TCGATTCCTT CTCCCTCGAC
GAGCTGACCC GTGGGGCGCG GGCCGGCGCC GACTTCCTGC TGAGCCTCAA CGAGGAGACG
CTGGATCTCG CCTTCGAGAC CGACGCGGTG CCGATCCTCG TGCCGATGCG GCCCGACGAC
CTGCCCTCCC TCGACCGCGC CATCGAACGG ATGGAGCGGG CGGGCCGGCC CTACATGGCC
GATCCGATCC TGGAGCCGAT CCATTTCGGC TTCGTCGACT CGATCGTCCG CTACCGCGAG
ATCCGCGCGC GCTGGCCGAA CATCGAGATG ATGATGGGCA CCGGCAACCT CACCGAACTG
ACCGAGGCCG ACAGCCTCGG TGTCACGGCG CTCCTCGTCG GCATGTGCTC GGAACTCGCC
ATCCGCAACG TGCTGATCGT GCAGGTCTCG AACCACACCC GCCGCACGGT GGAGGAGCAC
GACGCCGCCC GCCGGGTGAT GTACGCGGCC AAAGAGGACG CCGCCCTGCC CAAGGGCTAC
GGCCGCGAGT TGCTGGCGCT GCACGACAAG CGCCCCTTCG TGCAGACCTC CGATGAAATT
GCCGCTCTGG CCGCCGAGGT GCGTGATCCC AATTACCGCA TCGCCGTCGC CGAGGACGGC
ATCCACGTCT ACAACCGCGA CCGTCACACC ACCGGCACCG ACGCGATGGC CTTCTTCCCC
GAACTGAGCG TGGAGAGCGA CGGCGCGCAC GCCTTCTATC TCGGCGGAGA ACTGACGAAG
GCCGAGACCG CGTTCCGCCT CGGCAAGCGC TACGTGCAGG ACGAACCCCT CGATTGGGGC
TGCGCCGCCG ACCGCACCCA GGAAGACACC ACCGCCTTCA AGGCGGCCGG GCCGACGAAA
TCGGCGCATA CCAAGCATAG CGGCCCCGAG GCGCCCGCGG CCGAGCGCGC GACCCGGACC
GATCCGGAGC GCGACGCCGC ACCGCCGCGG ACCGAGACCG GAAGCAGCAC GGCATCGGAA
CGCGACCCGC TCAGCGAGCC GAAGGGCGGC CGCATCGTCT GCGGCCGGCT GGTGCCCGAC
GAGGACCGGA ATTAG
 
Protein sequence
MSAPEHLVFI TGKLAHARLE KVAATLPAER FTWSIADAGV KVAALMTEEI IKRRVQMPEG 
ATRIVLPGRC RANPEALAQH FGLPVERGPD EIVDLPAYLG LTGRKVDLSR HDLRIFSEIV
DASKMTPDQI LAKGLDLARR GADVIDLGGL PDTAFPHLED SVRALKGAGL KVSVDSFSLD
ELTRGARAGA DFLLSLNEET LDLAFETDAV PILVPMRPDD LPSLDRAIER MERAGRPYMA
DPILEPIHFG FVDSIVRYRE IRARWPNIEM MMGTGNLTEL TEADSLGVTA LLVGMCSELA
IRNVLIVQVS NHTRRTVEEH DAARRVMYAA KEDAALPKGY GRELLALHDK RPFVQTSDEI
AALAAEVRDP NYRIAVAEDG IHVYNRDRHT TGTDAMAFFP ELSVESDGAH AFYLGGELTK
AETAFRLGKR YVQDEPLDWG CAADRTQEDT TAFKAAGPTK SAHTKHSGPE APAAERATRT
DPERDAAPPR TETGSSTASE RDPLSEPKGG RIVCGRLVPD EDRN