Gene Mext_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1080 
Symbol 
ID5832768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1177831 
End bp1179132 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID641366874 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001638555 
Protein GI163850512 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.582859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCG ATCAGGATCG CATCTTCACC AATCTCTACG GCCTGCACTC GCCGGGGCTT 
GAGGCCGCGA AGAAGCGCGG CGCCTGGGAC GGAACCAAGT TCCTCCTCGA CATGGGCCGT
GACTGGATCA TCGACGAGAT GAAGGGCTCC GGCCTGCGCG GCCGTGGTGG CGCGGGCTTT
CCCACCGGCC TCAAATGGTC GTTCATGCCC AAGAAGTCCG ACGGGCGCCC GCACTACCTC
GTCGTCAACG CCGACGAATC GGAGCCGGGC ACCTGCAAGG ACCGGGAGAT CATGCGGCAC
GATCCGCATC TCCTGATCGA GGGCTGCCTG CTGGCCTCCT TCGCCATGGG GGCGCATGCC
TGCTACATCT ACATCCGCGG CGAGTACGTG GCGGAGAAGT TCGCCCTTCA GCGCGCGGTG
GACGAGGCCT ACGAGGCGCG CCTCGTCGGG CCGTCGAACA TCCACGACTA CCCGTTCGAC
ATCTACGTCC ACCACGGCGC GGGCGCTTAC ATCTGCGGCG AGGAAACGGC GCTGATCGAG
AGCCTGGAAG GCAAGAAGGG GATGCCGCGG CTGAAGCCGC CATTCCCCGC CAATATGGGC
CTCTATGGCT GCCCCACGAC CGTCAATAAC GTCGAATCGA TCGCGGTGGC CGGCACGATC
CTGCGCCGCG GCGGCGCGTG GTTCGCCGGC CTCGGCGGTA AGAACAACAC CGGCACCAAG
CTGTTCTGCG TCTCGGGCCA CGTCAACAAG CCCTGCAACG TCGAGGAAGA GCTCGGCATC
ACCTTCCGCG AGCTGATCGA TAAGCATTGC GGCGGCATGC GCGGCGGCTG GGACAATCTG
CTCTGCTCCA TCCCCGGCGG CTCCTCGGTG CCGCTGGTGC CGGCCGAGCA GATCATCGAC
GCCAAGATGG ACTTCGACAC CCTGCGCAAC CTCGGCTCGG GGCTGGGCAC CGCGGCGGTG
ATCGTGCTCG ACAAATCGAC CGACATCGTC GGCGCGATCG CCCGCATCTC GTACTTCTAC
AAGCACGAGA GCTGCGGCCA GTGCACGCCC TGCCGCGAGG GCACCGGCTG GATGTGGCGC
GTGCTGACCC GCATGGCTGC CGGCCGGGCG CAGAAGCGCG AGATCGACAT GCTCCTGGAA
GTCACCAAGC AGGTCGAGGG CCACACGATC TGCGCGCTGG GCGACGCCGC GGCATGGCCG
ATCCAGGGCC TGATCCGGCA CTTCCGCCCC GAGATTGAGA AGCGGATCGA CCAGTACAGC
GCCAACCCGC ACATGGATGC GGTGCCGATG GCGGCGGAGT GA
 
Protein sequence
MLADQDRIFT NLYGLHSPGL EAAKKRGAWD GTKFLLDMGR DWIIDEMKGS GLRGRGGAGF 
PTGLKWSFMP KKSDGRPHYL VVNADESEPG TCKDREIMRH DPHLLIEGCL LASFAMGAHA
CYIYIRGEYV AEKFALQRAV DEAYEARLVG PSNIHDYPFD IYVHHGAGAY ICGEETALIE
SLEGKKGMPR LKPPFPANMG LYGCPTTVNN VESIAVAGTI LRRGGAWFAG LGGKNNTGTK
LFCVSGHVNK PCNVEEELGI TFRELIDKHC GGMRGGWDNL LCSIPGGSSV PLVPAEQIID
AKMDFDTLRN LGSGLGTAAV IVLDKSTDIV GAIARISYFY KHESCGQCTP CREGTGWMWR
VLTRMAAGRA QKREIDMLLE VTKQVEGHTI CALGDAAAWP IQGLIRHFRP EIEKRIDQYS
ANPHMDAVPM AAE