Gene Mpe_A1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1410 
Symbol 
ID4783923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1520048 
End bp1521112 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID640089976 
ProductNADH-ubiquinone oxidoreductase, chain H 
Protein accessionYP_001020607 
Protein GI124266603 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0192046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGA GCTTCAACCA GTTCGGCAGC AGCCTGCTCG GCGGCTTCTG GCCCGTCGTG 
TGGAACCTGA TCAAGATCGT GGCACTGATC GCGCCGCTGA TGGGTTGCGT CGCCTACCTC
ACGCTGTGGG AACGCAAGGC CATCGGCTGG ACGCAGATCC GTCCCGGCCC CAACCGCGTC
GGCCCCTGGG GCCTGCTCAC GCCGATCGCC GATGCGGTCA AGCTGATCTT CAAGGAAATC
ATCCTGCCGA CGGCGGCCAA CAAGGGCCTG TTCCTGCTCG GCCCCGTGAT GACCATCATG
CCGGCGCTTG CCGCCTGGGT GGTCGTGCCG TTCGGCCCGG AAGTGGCGCT GGCCAACATC
AACGCCGGCC TGCTGTTCCT GATGGCGATC ACCTCGATGG AGGTCTATGG CGTGATCATC
GCCGGCTGGG CATCGAACTC GAAGTACGCC TTCCTCGGCG CGCTGCGCGC CTCGGCGCAG
ATGGTCAGCT ACGAGATCGC GATGGGCTTC GCGCTGGTGG TGGTGCTGAT GGTGTCGGGC
ACCCTGAACA TGACCGAGAT CGTGCTCGGG CAGGACAGGG GGCGCTTCGC CGACATGGGG
CTCAACTTCC TCAGCTGGAA CTGGCTGCCG CTGTTCCCCA TCTTCATCGT CTACTTCATC
TCCGGCCTCG CCGAGACCAA CCGCCACCCC TTCGACGTGG TGGAAGGCGA GTCCGAGATC
GTGGCCGGTC ACATGATCGA GTACTCGGGC ATGGCCTTCG CCATGTTCTT CCTGGCCGAG
TACGCCAACA TGATCCTGAT CTCGGCGCTT GCCGTGACCA TGTTCCTGGG GGGCTGGCTG
CCGCCGATCG ACAGCGTCGT CTTCAACTGG ATTCCGGGTT GGATCTGGCT GGGCCTCAAG
ACCTTCGTGG TCGTGACCAT GTTCCTGTGG GTGCGCTCCA CGTTCCCGCG CTTTCGCTAC
GACCAGATCA TGCGGCTGGG CTGGAAGATC TTCATCCCGA TCACGTTGAT CTGGCTGGTC
GTCGTGGGGC TGTGGATCCA GTCGCCGTGG AACATCTGGA AATAA
 
Protein sequence
MIESFNQFGS SLLGGFWPVV WNLIKIVALI APLMGCVAYL TLWERKAIGW TQIRPGPNRV 
GPWGLLTPIA DAVKLIFKEI ILPTAANKGL FLLGPVMTIM PALAAWVVVP FGPEVALANI
NAGLLFLMAI TSMEVYGVII AGWASNSKYA FLGALRASAQ MVSYEIAMGF ALVVVLMVSG
TLNMTEIVLG QDRGRFADMG LNFLSWNWLP LFPIFIVYFI SGLAETNRHP FDVVEGESEI
VAGHMIEYSG MAFAMFFLAE YANMILISAL AVTMFLGGWL PPIDSVVFNW IPGWIWLGLK
TFVVVTMFLW VRSTFPRFRY DQIMRLGWKI FIPITLIWLV VVGLWIQSPW NIWK