Gene Mext_4444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4444 
Symbol 
ID5834107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4949357 
End bp4950682 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID641370237 
ProductFolC bifunctional protein 
Protein accessionYP_001641883 
Protein GI163853840 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.545059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCT CCGACGCGTT GATGGCGCGC TTCCTCGCCC TGCATCCGCG CACGATCGAC 
CTGTCGCTCG GGCGCATTCA GCGCCTGCTC GCGGCCCTGA ATCATCCCGA GCGGCGGCTG
CCGCCGGTGA TTCACGTCGC CGGCACCAAC GGCAAGGGCT CGACCATCGC CTTTATGCGG
GCAATCCTGG AGGCGGGGGG CCTGGCTGCC CACGTCTACA CCTCGCCCCA CCTCGTGCGC
TTCCATGAGC GCATCCGCTT AGGCGGCATT GGCGGCGGTC ACTACGTCGC CGAAGACCGG
CTCGCCGATG CCTTCGCCCG CTGCGAGGCG GCCAACAAGG GCGATCCGAT CACCGTGTTC
GAGATCACCA CCGCGGCCGC CCTGCTGCTG TTTTCCGAAT GCCCCGCCGA CGTGCTGCTG
CTGGAAGTGG GCCTCGGCGG CCGGGTCGAT GCCACCAACG TCATCGACCA CCCGGCCTGC
GCCGTGGTCA CCCCGATCGG GCGCGACCAT GCCGAATATC TCGGCGACAC CGTCGAGGCG
GTGGCGATGG AGAAGGCCGG CATCTTCAAG CGCGGCTGCC CGGCGGTGAT CGCCGCCCAG
GATTATGCCG GGGCCGACGC CGTCCTCTGC CGCCAAGCCG AGCGCGTCGG CGCGGTGCCG
GTGCGGATCG GCAACCAGGA CTTCTCCGTA CACGAGGAGA GCGGGCGCTT CGTCTACCAG
GACGAGATCG ACCTGTTCGA TCTGCCGCGC CCCCGCCTCG CAGGGCGCCA CCAGCTCACC
AATGCCGGCA CTGCCATCGC GGCCCTGCGC GCGGCGGGCT TCGGCGATAT CGGCACGGTC
GCCCTCGAAG CCGGTCTGCG CAACGTCGAT TGGCCGGGCC GGCTCCAGCG CCTCGTGCGC
GGGGCGCTCG CCGAGCGGAT GCCCAAGGAC GCCGAGCTGT GGCTCGACGG CGGCCACAAT
GCCGATGGCG GGCGCATCCT CGCCGCCGCC ATGGCCGATC TGGGCGAGCG CAGCGACGTG
CCGCTGGTCC TGATCGTCGG GCTGCTCGGC ACCAAGGATG CCGAAGGCTT CCTGAAGAAC
TTCGTCGGCC TTGCCCGCTC GCTGGTAGCG GTGCCGATCA CCGGCCAGAT GGCCGCGCGG
CCCGCCGAGG AAGTGGCGGA AATCGCCCGT GAGGTCGGTC TCTCGGCCGA GGTCGCTCCG
AGCGTCGAGG CGGCGTTGGC GGCCCTGTCG GACACGGTCT TCGAGCGCCC GCCGCGGGTC
CTCATCTGCG GTTCGCTTTA TCTCGCGGGC GCCGTGCTCG AAGCCAACGG CACGATCCCG
GTCTGA
 
Protein sequence
MESSDALMAR FLALHPRTID LSLGRIQRLL AALNHPERRL PPVIHVAGTN GKGSTIAFMR 
AILEAGGLAA HVYTSPHLVR FHERIRLGGI GGGHYVAEDR LADAFARCEA ANKGDPITVF
EITTAAALLL FSECPADVLL LEVGLGGRVD ATNVIDHPAC AVVTPIGRDH AEYLGDTVEA
VAMEKAGIFK RGCPAVIAAQ DYAGADAVLC RQAERVGAVP VRIGNQDFSV HEESGRFVYQ
DEIDLFDLPR PRLAGRHQLT NAGTAIAALR AAGFGDIGTV ALEAGLRNVD WPGRLQRLVR
GALAERMPKD AELWLDGGHN ADGGRILAAA MADLGERSDV PLVLIVGLLG TKDAEGFLKN
FVGLARSLVA VPITGQMAAR PAEEVAEIAR EVGLSAEVAP SVEAALAALS DTVFERPPRV
LICGSLYLAG AVLEANGTIP V