Gene Msil_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2394 
Symbol 
ID7093946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2608086 
End bp2609480 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content63% 
IMG OID643465716 
Productdihydropteroate synthase DHPS 
Protein accessionYP_002362686 
Protein GI217978539 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR00284] dihydropteroate synthase-related protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GCTTTTTGTT CGTCACCGGC CATCTCGCCT ATCCGCGCCT TGAGCGGATG 
ATGCGCTCGC TTGGGGACAC GCCGTTTGCC TGGGACATCG CCAATATCGG CGTCAAGGTT
GCGGCCCTGA TGACCGAGGC GATCTTGCTG CGGCGCCTGC CGTGTCCGCT CAACGCCGAT
CGCATCATCC TGCCCGGCCG GTTTCGCGGC GATCTTGCGG CGCTTTCAAA GGAGCTTGGC
GTGCCTGTCG TGCGCGGCCC CGACGAGATC AGCGATCTTC CCGTGTATCT CGGCCGCGGC
GGGCAGGAGC CTGACCTGTC GCGCTATGAC ATCCGCATCT TTGCGGAGAT CGTCGACGCC
TCCGCCCTAT CGGTCGAGGC GTTGCTGGCG CGGGCGGACA AACTGCGGCG CGCGGGCGCC
GATGTGATCG ATCTCGGCAG CCTGCCGGAT ACGCCTTTTC CGCATCTTGA GGACGCCGTG
CAGGCGCTCA AGGCGACGGG CGTTCGCGTC AGCGTCGATT CTTTCGATCA TGTCGAATTG
CGGCGCGGCG CGAAGGCCGG CGCGGATTTT CTTTTGAGCC TCGACGAGAA CTCTCTCGAC
ATCGCGGACG ACACTGGGAC GACGCCCATC CTCGTCGGCG CGCCGCTGCA TGACATCGAC
TCGCTGGCGC GCGCGGCGGA AGCCGCCCAA CGGCGCGGCC TGCCCTATAT CGTCGATCCG
ATTCTCGACC CGATCCATTT TGGTTTTTCC GACTCGATCG CCCGCTATGT CGAGACGCGG
CGCCGGTTGC CGGAAGCTGA AATGATGATG GGAACGGGCA ATCTGACGGA GTTGACGGAG
GTCGATTCCG GCGGCCTCAC CGGAGCGCTT CTCGGCATCT GCTCCGAGCT GAAAATCCGC
AATGTGCTCA CCGTTCAAGT GAGCCCGCAT ACGCGGCGTA CGATCGAGGA GCACGATGCG
GCGCGGCGCA TGATGTTTGC GGCGCGGGCC GATAATTCGC TGCCAAAAGG CTATGGGACG
GCGCTGTTGC AACTGCATGA CAAAACGCCC TTTGCCTCGA CGTCGGAGGA AATCGCCGAA
CTCGCCGGCG ATGTGCGCGA CAAGAATTTC CGCATCGCGA CAGCGGTCGA CGGCGTTCAT
ATCTACAATC GTGACGGTCA TCATATAGCG AAAGACGCCT TCTCCTTATT TCCCAAACTC
GGCGTCGAGG CGGACGGCCC GCACGCCTTT TATCTCGGCG CCGAACTCAC CAAAGCCGAG
ATCGCTTTTG CGCTTGGCAA ACGCTATGCG CAGGACGAGC TGATTGATTG GGGCGTCGGC
GCGGACCGGC CGGACGAGCA GCGCGATCGA CTGCGCGAGG CCGGGCATAC GCTGCGGCGC
AAGGAGGAGC CATGA
 
Protein sequence
MSERFLFVTG HLAYPRLERM MRSLGDTPFA WDIANIGVKV AALMTEAILL RRLPCPLNAD 
RIILPGRFRG DLAALSKELG VPVVRGPDEI SDLPVYLGRG GQEPDLSRYD IRIFAEIVDA
SALSVEALLA RADKLRRAGA DVIDLGSLPD TPFPHLEDAV QALKATGVRV SVDSFDHVEL
RRGAKAGADF LLSLDENSLD IADDTGTTPI LVGAPLHDID SLARAAEAAQ RRGLPYIVDP
ILDPIHFGFS DSIARYVETR RRLPEAEMMM GTGNLTELTE VDSGGLTGAL LGICSELKIR
NVLTVQVSPH TRRTIEEHDA ARRMMFAARA DNSLPKGYGT ALLQLHDKTP FASTSEEIAE
LAGDVRDKNF RIATAVDGVH IYNRDGHHIA KDAFSLFPKL GVEADGPHAF YLGAELTKAE
IAFALGKRYA QDELIDWGVG ADRPDEQRDR LREAGHTLRR KEEP