Gene Mext_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1456 
Symbol 
ID5835791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1630332 
End bp1632422 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content71% 
IMG OID641367256 
Producthypothetical protein 
Protein accessionYP_001638928 
Protein GI163850885 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.249298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCC TCAGCTTCAC GCCCCTCGTC CCCTGGCCGG TGCTCGTCGG ATTCGGCGTG 
CTCGCCGTGA TCCTCGCCGT CGTCGCGGTC CTCGCGCGTG GGCGCACGGC CCTGTTGCGG
GCGGTGGCGC TCGGCCTCGT CATCGCGGCG CTCGCCAATC CCTCGCTCGT GCGCGAGGAC
CGCGACCCGG TGAAGGACGT GGCGGCCATC GTCGTCGATC GCTCCGGCTC GCAAAGCTTG
GGCGATCGCC CGGCCATGAC CGACGCGGTG AAAGCCGAGC TGGAGCGCCG CTTCGGCGCG
CTGGCGAATA TCGAGCCCCG CTTCATCGAG GTCGGCGACG CCCAGGGCGG GGAGGGCGAC
GACGGCACCA AGCTGTTCAC GGCGCTGACC CAGGCGCTCG CCGACGTGCC GCCGGAGCGG
ATCGCCGGCG TGGTGATGCT CACCGACGGC GTGGTGCACG ACATCCCCGC CTCGCTCGAA
GCGCTTGGCC TCAAGGCGCC GCTGCACGTT CTCGTCACAG GACGGCCGGA CGAGCGCGAC
CGTCAGATCA AGATGCTGGA GGCGCCCCGC TTCGGCATCG TCGGCCGGGA CGTCACCTTG
CGCGGCGAGG TGATGGAGCG CGGCGGCACC GGCACGGCGA CTGTCACCGT GCGCCGCGAC
GGCGAGGAGA TCGACCGCCA AAGCATCGCC ACCGGCGTGC CGTTCTCGCT CACCACGCAT
ATCGAGCATG GCGGACCGAA CGTCGTCGAG ATCGAGGTCG AGCCGCTGCC GGGCGAGCTG
ACCACGGTCA ACAACCGCGC CGTCCTGCCC ATCGAAGGAA TTCGCGAAAA ACTGCGGGTG
CTGCTCGTCT CAGGCGAGCC GCATCAGGGC GAGCGCACTT GGCGCAACCT CCTGAAGTCG
GACGCCTCCG TCGATCTCGT CCACTTCACG ATTCTCAGGC CGCCGGAGAA GCAGGACGGC
ACCCCGATCT CCGAACTCTC GCTGATCGCC TTCCCGACCC GCGAGCTGTT CGTCCAAAAG
ATCAAGGATT TCGACCTCAT CATCTTCGAC CGCTACGCCA ACCAGAGCGT GCTGCCGTCC
GCCTATTTCG ACAACATCGC CCGCTACGTC CGCGAGGGCG GTGCGCTCCT CATCGCCGCA
GGGCCGGAAT TCGCCGGTCC CGCCAGCCTT GCCCGCACCC GGCTCGCCTC GATCCTGCCG
GGCGATCCCT CGATGAAGGT GGTCGAGCAG CCGTTCAAGG CGACGCTCAC CGAGACCGGC
CACCGCCACC CCGTCACCCG GGCGCTGCCG GGCTCGGAGG CGAACCCGCC GGCCTGGGGC
GACTGGCTGC GCATCGTCTC GGCCCAGACC CGGCCGGGCG TGCAGCCGAT CCTGTCGGGC
GCCAACGGCC TGCCGCTCCT GGCACTCTCG CGTGAGGAGA AGGGCCGCGT CGCCCTGATG
CTGTCCGATC AGGCTTGGCT CTGGGCCCGC GGCTACCAGC AGGGCGGCCC CTATCTCGAT
CTGCTGCGGC GCCTCGCCCA CTGGCTGATG AAGGAGCCCG CGCTCGAAGA GGAGGCCCTG
CGTGCCCAGA CCACGGGCCG GGGCCGCGAG GTCCGCGTCG AGCGCCAGAC CATGGCCGAG
GAGGCGGGCC CCGTCACGAT CACCGGTCCG ACCGGCAAGG AGCGCAGTCT CGACCTGTCG
AAGGCCGAGA CCGGCCTGTT TACCGCGACC TTCGAGGCGG AGGGGCTCGG GCTCCATACC
ATCCGCTCGG GCAACCTCGT CGCCTTCGTC AGCGTCGGCC CGGCCAACCC GCGCGAACTC
GCCGACGTGT TCAGCGACAC CGACCGGCTG AAGGGCGTGG CGGACGGCTC CGGCGGCACG
GTGCGCCGGG TCGCGGCGGC CGGCGGCGGC ATCGAAGTGC CCCGCCTCCA GATCACCCGC
GGCGGCCGTC TGGGCGGCGC CGACTGGATC GGCTTCCGCC CGAGCGACAG CGCCACGATA
CGCGGCGTCG AGGTCTATCC CCTCGGCATC GGCCTGTGGG CGCTGGTGGC CCTCGCCGCC
GCCGTGCTGG CGATGTGGCT GGTCGAGGGA CGGCGCGGAC GCGCGGCGTA G
 
Protein sequence
MLSLSFTPLV PWPVLVGFGV LAVILAVVAV LARGRTALLR AVALGLVIAA LANPSLVRED 
RDPVKDVAAI VVDRSGSQSL GDRPAMTDAV KAELERRFGA LANIEPRFIE VGDAQGGEGD
DGTKLFTALT QALADVPPER IAGVVMLTDG VVHDIPASLE ALGLKAPLHV LVTGRPDERD
RQIKMLEAPR FGIVGRDVTL RGEVMERGGT GTATVTVRRD GEEIDRQSIA TGVPFSLTTH
IEHGGPNVVE IEVEPLPGEL TTVNNRAVLP IEGIREKLRV LLVSGEPHQG ERTWRNLLKS
DASVDLVHFT ILRPPEKQDG TPISELSLIA FPTRELFVQK IKDFDLIIFD RYANQSVLPS
AYFDNIARYV REGGALLIAA GPEFAGPASL ARTRLASILP GDPSMKVVEQ PFKATLTETG
HRHPVTRALP GSEANPPAWG DWLRIVSAQT RPGVQPILSG ANGLPLLALS REEKGRVALM
LSDQAWLWAR GYQQGGPYLD LLRRLAHWLM KEPALEEEAL RAQTTGRGRE VRVERQTMAE
EAGPVTITGP TGKERSLDLS KAETGLFTAT FEAEGLGLHT IRSGNLVAFV SVGPANPREL
ADVFSDTDRL KGVADGSGGT VRRVAAAGGG IEVPRLQITR GGRLGGADWI GFRPSDSATI
RGVEVYPLGI GLWALVALAA AVLAMWLVEG RRGRAA