Gene M446_5017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5017 
Symbol 
ID6132403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5499936 
End bp5501351 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content73% 
IMG OID641645153 
Productdihydropteroate synthase DHPS 
Protein accessionYP_001771778 
Protein GI170743123 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR00284] dihydropteroate synthase-related protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00770286 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAGC ACCTCGTCTT CGTCACCGGG CGCCTCGCCA AGCCGCGGCT CGAATCGGTC 
GTCGCGGCGC TCCCGCGCGA GCGCTTCACC GGCACCATCG CGGATGCGGG CGTGAAGGTC
GCCGCGCTGA TGACCGAGGA GATCATCCGC CGCCGCGTCA CGCTGCCGGA GGGGGCCGAC
CGCGTCATCC TGCCGGGGCG CTGCCGGGCC GACCTCGCCG CGCTCTCCGC CCATTTCGGG
GTGCCGGTCG AGCGCGGGCC CGACGAGATC GTCGACCTGC CGGCCCATCT CGGCCTCGCC
GGCCGCAAGG TCGACCTGTC GCGGCACGAC CTCACCATCT TCTCGGAGAT CGTCGACGCC
TCCCGCATGA CGCCGGACGA GATCCTCGTC CGCGCCCGGG ACCTCGCCCG CCGGGGCGCC
GACGTGATCG ACCTCGGCGG CCTGCCGGAC ACGCCCTTCC CGCACCTGGA GGAGGCGGTG
CGGCTGCTGA AGGGGGCGGG CCTGCGGGTC AGCGTCGATT CCTTCGACCG CGAGGAACTC
GCCCGGGGCG CGCGGGCGGG CGCCGATTTC CTGCTGAGCC TCAACGAGGA CAGCCTCGAC
CTCGCCTTCG AGACCGACGC CGTGCCGGTG CTGGTGCCGG TGCGGCCCGA CGACCTCGAG
TCCCTCGACC GCGCGATCGC GCGGATGCGG GCGGCGGGAA AGCCCTTCCT GGCCGACCCG
ATCCTGGAGC CGATCCATTT CGGCTTCGCC GCCTCGATCG TGCGCTACCA CGAGACCCGC
CGCCGCCATC CCGACATCGA GATGATGATG GGGACCGGCA ACCTGACCGA ACTGACCGAG
GCGGACAGCG TCGGGGTGAC GGCGCTCCTG GTCGGCCTGT GCTCGGAACT GGCGATCCGC
AACGTGCTGA TCGTGCAGGT CTCGAACCAC ACCCGACGCA CGGTCGAGGA GCACGACGCC
GCCCGGCGGG TGATGTACGC GGCCCGCGCC GACGGGGCGC TGCCGAAGGG CTACGGCCGC
CAGCTCCTCG GACTCCACGA CAAGCGCCCC TACACGCAGA CGCCCGAGGA GATCGCGGCG
CTGGCCGCCG AGGTGCGCGA CCCGAACTAC CGCGTCGCGG TCGCGCAGGA CGGGGTCCAT
GTCTACAACC GGGCGATCCA CAAGGTCGGC ACCGACGCCA TGGCGTTCTT CCCCGACCTC
GACGTGGCGA CCGACGGCGG CCACGCCTTC TATCTCGGCG GGGAATTGAC CAAAGCCGAA
CTCGCCTGGC GGCTCGGCAA GCGCTACGTG CAGGACGAGC CCCTGGACTG GGGCTGCGCG
GCGGATGCGG CGGCGGAGGA CACCACCGCG TTCAAGGAGG TCGGCCACAC CCTGCACGGG
CGCCGGCCGG CCCGCGCGCC CGACGGGACG GAGTGA
 
Protein sequence
MTEHLVFVTG RLAKPRLESV VAALPRERFT GTIADAGVKV AALMTEEIIR RRVTLPEGAD 
RVILPGRCRA DLAALSAHFG VPVERGPDEI VDLPAHLGLA GRKVDLSRHD LTIFSEIVDA
SRMTPDEILV RARDLARRGA DVIDLGGLPD TPFPHLEEAV RLLKGAGLRV SVDSFDREEL
ARGARAGADF LLSLNEDSLD LAFETDAVPV LVPVRPDDLE SLDRAIARMR AAGKPFLADP
ILEPIHFGFA ASIVRYHETR RRHPDIEMMM GTGNLTELTE ADSVGVTALL VGLCSELAIR
NVLIVQVSNH TRRTVEEHDA ARRVMYAARA DGALPKGYGR QLLGLHDKRP YTQTPEEIAA
LAAEVRDPNY RVAVAQDGVH VYNRAIHKVG TDAMAFFPDL DVATDGGHAF YLGGELTKAE
LAWRLGKRYV QDEPLDWGCA ADAAAEDTTA FKEVGHTLHG RRPARAPDGT E