Gene Mext_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3441 
Symbol 
ID5833298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3819865 
End bp3821457 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content68% 
IMG OID641369240 
Productphytoene desaturase 
Protein accessionYP_001640898 
Protein GI163852855 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.826621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0554878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCAGT CGCGGGAAGG TTTGCGTACC CTGTTGGGCC GCCGCACGAT CGTCGTCGGG 
GCCGGCCCCG GCGGGCTTGC CACGGCGCTG CTCCTGGCCA AGGCCGGACT GCACGTCACG
CTGATCGAAA AGGACGCCCA AGTCGGCGGG CGCACCAAGA CCGTGGAGGC GCCGGGCGGC
TACCGCTTCG ACATCGGCCC GACCTTCTTC CTCTATCCGC AGATCCTCGC CGACATCTTC
GAATCCTGCG GCGAGCGCCT GGAAGACCAT GTCCGGCTGG AGCGGCTCGA TCCGCAATAC
AACCTCGTCT TCGGGGGTGA GGGCGGCATT TCGGGGCAGA TCCGCGCCAC CGGCGATGTG
CCGCGGCTGA AGGCCGAGAT CGCCCGCCTC GCCCCCGCCG ATGCCGAGAA CGTCGAGAAG
TTCTTCGAAG AGAACCGGAC CAAGCTGAAC TACTTCAAGC CGGTGCTGGA GCAGCCCTTC
GACAACATCC TGTCGATGGC GAGCCCGGCG ATGCTGGCCG CCCTGCCCCA TCTCCATCCC
GGCCGCAGCG TGGACCGGGA TCTCAAGCGC TACTTCGCCG ACCCGCGGGT GCGCCTCGCC
TTCTCGTTCC AGACCAAATA TCTCGGGATG AGCCCCTTCC GCTGCCCGAG CCTGTTCACG
ATCCTCTCGT TTCTTGAATA CGAGCACGGG GTCTACCACC CGGTCGGCGG CTGCGGCGCG
GTCTCGGAGG CGATGGCGGG GCTGGCCCGC CGCATGGGCG TCGACATCCG CCTCGGGCAA
TCGGTCGAGC GGATCCTGTT CGAGGGCAAG CGCGCCACCG GCGTGGTCGT CGGCGGCGAG
ACTCTGAAGG CCGATGCGGT GGTCGTGAAC GGCGACTTCG CCAAGGTGAT CCGCGATCTC
GTGCCGGAGG AGCGGCGCCC GCGCTGGCGC GACGCCAAGA TCGGCAAGGC GCGCCTCTCC
TGCTCGACCT ACATGCTCTA CCTCGGCATC GAGGGCAAAA TGCCCGAGAG CCTGGGCCAC
CACACCATCC TGCTGGCCAA GGAATACGAG CGCAACATCA AGGAGATCAC CGGCGGCACG
CTGCCGATGG AGCCTTCGAT CTACGTGCAG CATGCCGGCT TCACCGATGG CGGCATGGCG
CCGCCCGGCC ACACCGCCCT CTACGTGCTG GTGCCGGTGC CGAACCTGAA GGCGGGCATC
GATTGGGAGA CGGTCGGCCC GACCTACCGC AAGCTGATTC TGGAGCGCTT GAAGCTCCTC
GGCCTTCCCG ACATCGAGAG CCGCATCCGC TACGAGCGCG TGGTCGATCC CCGCGACTGG
CGCGACGAAT TCGCGGTGCA CGAGGGCGCG ACCTTCAACC TCGCCCACGA TCTCATGCAG
ATGCTGTGGT TCCGGCCGCA TAACCGCTTC GGGCCGGGGC TCTACCTCGT CGGCGGCGGC
ACCCATCCGG GCTCCGGTCT TCCGGTGATC TACGAGGGCG CGCGCATCTC GGCCCGGCTC
CTGATCGAGG ATCTCGCCAA GGAGAAGGCG CCCGCGATCC TGGCGGACCT GCCGACGACC
TCGCCGCTCG CCACGCAGGG CGATCCGAGC TGA
 
Protein sequence
MLQSREGLRT LLGRRTIVVG AGPGGLATAL LLAKAGLHVT LIEKDAQVGG RTKTVEAPGG 
YRFDIGPTFF LYPQILADIF ESCGERLEDH VRLERLDPQY NLVFGGEGGI SGQIRATGDV
PRLKAEIARL APADAENVEK FFEENRTKLN YFKPVLEQPF DNILSMASPA MLAALPHLHP
GRSVDRDLKR YFADPRVRLA FSFQTKYLGM SPFRCPSLFT ILSFLEYEHG VYHPVGGCGA
VSEAMAGLAR RMGVDIRLGQ SVERILFEGK RATGVVVGGE TLKADAVVVN GDFAKVIRDL
VPEERRPRWR DAKIGKARLS CSTYMLYLGI EGKMPESLGH HTILLAKEYE RNIKEITGGT
LPMEPSIYVQ HAGFTDGGMA PPGHTALYVL VPVPNLKAGI DWETVGPTYR KLILERLKLL
GLPDIESRIR YERVVDPRDW RDEFAVHEGA TFNLAHDLMQ MLWFRPHNRF GPGLYLVGGG
THPGSGLPVI YEGARISARL LIEDLAKEKA PAILADLPTT SPLATQGDPS