Gene Mpe_A3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3103 
Symbol 
ID4786676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3298506 
End bp3303245 
Gene Length4740 bp 
Protein Length1579 aa 
Translation table11 
GC content66% 
IMG OID640091674 
Productglutamate synthase (NADH) large subunit 
Protein accessionYP_001022291 
Protein GI124268287 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.456903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCGT CCGCCCCAAG CCAGCAAGAC ATCGACACCC TTCGCCTCGA AGGCCTCTAC 
GATCCGGCCC ATGAACACGA TGCCTGCGGC GTCGGCTTCG TGGCGCACAT CAAGGGCCTC
AAGGCACACA GCATCGTCGG CCAGGGGCTG AAGATCCTGG AGAACCTCGA TCACCGTGGC
GCGGTCGGGG CTGACAAGCT GATGGGTGAC GGCGCTGGGA TCCTGATCCA GATTCCCGAC
GAGTACTACC GCGCCGAGAT GGCTCTCCAG GGCATCGAAC TGCCGCCGCC CGGCGAGTAC
GGTGTCGGCA TGATTTTCCT GCCCAAGGAA CACGCCTCGC GACTGGCCTG CGAGCAGGCG
CTGGAGCGCG CCGTGCACGC CGAAGGGCAG GTGCTGCTGG GCTGGCGCGA CGTGCCGGTC
GACCGGGCCA TGCCGATGTC GCCCACGGTG CGGGCCAAGG AGCCGGTGAT CCGTCAGATC
TTCATCGGCC GTGGCGCCGA CGTGATCGTG CCCGATGCGC TGGAGCGCAA GCTCTACGTG
ATCCGCAAGA CGGCATCGGC GGCGATCCAG AAGTTGCAGC TCACGCACAG CCACGAGTAC
TACGTGCCCA GCATGAGCTG CCGCACCATC ATCTATAAGG GTTTGCTGCT TGCCGATCAG
GTGGGCACTT ACTACCTGGA TCTGCAAGAC GAACGCACGG TGTCGGCGCT TGCGCTGGTG
CACCAACGCT TCTCGACCAA CACCTTCCCG GAATGGCCGC TGGCGCACCC CTACCGGATG
GTGGCGCACA ACGGCGAGAT CAACACCGTC AAGGGCAACT TCAACTGGAT GCGCGCGCGC
CAGGGCGTGA TGAAGTCGCC GGTGCTGGGG GATGACCTGA ACAAGCTGTA TCCGATCAGC
TTCGATGGCC AGTCCGACAC CGCGACCTTC GACAACGCGC TGGAATTGCT GACGATGGCC
GGCTACCCGC TGGCTCACGC CGCGATGATG ATGATCCCGG AGGCGTGGGA GCAGCATGCG
ACGATGGACG AGCGGCGCCG TGCCTTCTAC GAATACCACG CTGCAATGCT GGAGCCGTGG
GACGGTCCCG CCGCGATGGT GTTCACCGAT GGCAAGCAGA TCGGCGCCAC CCTCGACCGC
AACGGCCTGC GTCCGGCCCG CTACATCGTC ACTGACGACG ATCTGGTGGT GATGGCCTCC
GAGTCCGGCG TGCTGCCGAT CCCGGAGAAC AAGATCGTCA AGAAGTGGCG CCTGCAGCCC
GGCAAGATGT TTCTGATCGA CCTGGAACAG GGCCGCATCG TCGACGACGA GGAACTGAAG
AACACCTACG CCTCGGCCAA GCCCTACCGG CAGTGGATCG AGAACGTGCG CATCAAGCTC
GACGCCATCG CCGCGCCGGC CGAGGCCGCG CCGGCCTTTG CCGAAAACCT GCTCGATCGC
CAGCAGGCTT TCGGTTACAC GCAGGAAGAC ATCAAGTTCC TGATGAGTCC GATGGCCCAG
ACCGGGGAAG AGGGCATCGG ATCGATGGGC AACGACTCGC CGCTGGCCGT GCTGTCCGAC
AAGAACAAGC CGCTGTACAG CTACTTCAAG CAGATGTTCG CGCAGGTCAC CAACCCGCCG
ATCGATCCGA TCCGCGAGGC GATCGTGATG TCGCTGAACA GCTTCATCGG CCCCAAGCCC
AACCTGCTGG ACATCAATGC GGTCAACCCG CCGATGCGGC TGGAGGTCGC GCAACCCATC
CTCGACTTCG AGGACATGGC GCGCCTGCGT AGCATCGAGA AGCCGACGCA CGGCAAATTC
AAGTCGTATG AGCTCAACAT CGTCTATCCG TACGCCTGGG GCGGTGAGGG CGTGGAGGCC
AAGCTGGCAT CGCTGTGCGC CGAGGCGGTG GACGCGATCC AGGGCGGCCA CAACATCCTG
ATCATCACCG ACCGTCGCAT GAGCCGCGAC CAGATCGCGA TCCCCGCGCT GCTGGCCCTG
TCTGCGATCC ACCAGCACCT GGTTCGCGAG GGCCTGCGCA CCACCGCCGG CCTGGTGGTC
GAGACCGGCT CGGCGCGTGA GGTGCATCAT TTCGGCGTGC TTGCCGGCTA CGGCGCCGAG
GCCGTGCACC CCTACCTCGC GATGGAGACG CTGGCCTCGC TGCACAAGGA GTTGCCAGGC
GACCTGTCGG CCGACAAGGC GATCTACAAC TACGTGAAGG CGATCGGCAA GGGCTTGTCG
AAGATCATGT CCAAGATGGG CGTGTCGACC TACATGTCCT ACTGCGGTGC GCAACTGTTC
GAGGCGATCG GCCTCAACAA GCCGCTGGTC GAGAAGTACT TCCGCGGCAC GGCCTCGCAA
GTCGGCGGCA TCGGCGTGTT CGAGGTTGCC GAAGAGGCGT TGCGCATGCA CAGGGCAGCT
TTCGGCGACG ACCCGGTCCT GGCGACGATG CTCGACGCTG GTGGCGAGTA CGCCTGGCGT
ACACGCGGCG AGGAGCACAT GTGGACGCCG GACGCCATCG CGAAGCTGCA ACACAGCACG
CGCGCGAACA AGTTCGACAC CTACAAGGAA TACGCCCAGC TGATCAATGA CCAGTCGCGC
CGGCACATGA CGCTGCGTGG GCTGTTCGAG TTCAAGCTCG ACCCCAGCAA GGCCATCCCG
CTCGATCAGG TCGAACCGGC CAGCGAGATC GTCAAGCGTT TCGCCACCGG CGCCATGTCG
CTCGGCTCGA TCAGCACCGA GGCGCACAGC ACCCTCGCGA TCGCGATGAA CCGCATCGGC
GGCAAGTCCA ACACCGGCGA GGGCGGCGAG GACCCTGCGC GCTACCGCAA CGAGCTCAAG
GGCATCCCGA TCAAGCAGGG CACGATGGTC AGCGAAGTGG TCGGCAGCAA GGTGATCGAG
GCCGACTACG AGCTGAAGGA CGGCGATTCG CTGCGCTCGA AGATCAAGCA GGTGGCGTCG
GGGCGCTTCG GCGTGACGAC CGAGTACCTG GTCTCGGCCG ACCAGATCCA GATCAAGATG
GCCCAGGGCG CGAAGCCCGG CGAGGGCGGC CAGTTGCCGG GTGGCAAGGT GTCCGAGTAC
ATCGGCATGT TGCGCTACTC GGTGCCGGGC GTCGGCCTGA TCTCGCCGCC GCCGCACCAC
GACATCTACT CGATCGAGGA CCTTGCGCAG CTGATCCACG ACCTGAAGAA CGCCAACCCG
CGCGCCTCCA TCAGCGTCAA GCTGGTGTCG GAGGTGGGTG TCGGCACGAT CGCCGCCGGC
GTCGCCAAGG CCAAGAGCGA CCACGTGGTG ATCGCCGGTC ACGACGGCGG TACCGGGGCC
TCGCCGTGGT CCAGCATCAA GCACGCCGGC ACGCCCTGGG AGCTTGGCCT GGCCGAGACC
CAGCAGACGC TGGTGCTCAA CGGTCTTCGT GGCCGCATCC GCGTGCAGGC CGACGGCCAG
ATGAAGACCG GCCGCGACGT CGTGATCGGT GCGCTGCTCG GGGCCGACGA GTTCGGCTTC
GCGACCGCAC CGCTGGTGGT CGAGGGCTGC ATCATGATGC GCAAGTGCCA CCTCAACACC
TGTCCGGTGG GCGTGGCAAC GCAGGATCCG GTGCTGCGCG CCAAGTTCCA GGGCAAGCCC
GAGCACGTCG TCAACTACTT CTTCTTCGTT GCCGAGGAAG CGCGCCAGAT CATGGCGCAA
CTGGGCATCC GCAGCTTCGA CGAACTGGTC GGTCGCGCCG ATCTGCTCGA CACCAAGAAG
GGCGTGTCGC ACTGGAAGGC GCGCGGCCTC GATTTCGGGC GTGTGTTCCA CCTGCCGCAG
GTGGGCGCGG ACGTGCCGCG TCGCCAGGTC GACGTGCAGG ATCACGGCCT GGCCAAGGCG
CTCGACGTGC GCCTGATCGA GAAGTGCCGT CCAGCCATCG AGCGTGGCGA GAAGGTCCAG
TTCATGGACG AGACGCGCAA CGTCAACCGC ACCGTCGGCG CCATGCTGTC GGGCGAATTG
ATCCGCCACC GGCCCGAAGG CCTGCCCGAC CACACCATCT TCATGCAGAT GGAAGGCGTG
GGCGGCCAGA GCTTCGGCGC CTTCCTGGCG CAGGGCATCA CGCTCTACCT GATCGGCGAT
GCGAACGACT ACACCGGCAA GGGTCTGAGC GGCGGCCGCG TGGTGGTGCG CCCGAGCATC
GACTTCCGAG GCGACGCCAC GCAGAACATC ATCGTCGGCA ACACGGTGCT CTACGGGGCC
ACCAGCGGCG AGGCCTTCTT CCGCGGCGTG GCCGGCGAAC GCTTCGCGGT GCGACTGTCG
GGCGCGACCA CGGTGGTCGA AGGAACCGGT GACCACGGCT GCGAATACAT GACCGGCGGC
ACGGTGGTCG TGCTCGGCAA GACCGGGCGC AACTTTGCGG CCGGCATGTC GGGCGGCATC
GCCTACGTCT ACGACGAAGA CGGCAGCTTC TCGCAGCGTT GCAACACCGC GATGGTCGCG
ATGGACAAGG TACTCACGGC CGACGAGCAG CGCAGTACCC AGGAAGCCGC CATCTTCCAC
AAGGGTGTGG CCGACGAGGT GCTGCTGCGC AAGCTGATCG AGGACCACCA CCGCTGGACC
GGCTCGCTGC GCGCGCGCGA CATCCTCGAC CACTGGCCGG CGGCACGCGG CAAGTTCGTC
AAGGTCTTCC CGCACGAGTA CAAGCGGGCG CTGGGCGAGA TCCACGCCAA GAAGGAGGCC
AGCGAGACCA TCGCCAAGGC CAGGACCAAC GACAAGAAGT CGAGCAAGGC CAAGGCCTGA
 
Protein sequence
MTSSAPSQQD IDTLRLEGLY DPAHEHDACG VGFVAHIKGL KAHSIVGQGL KILENLDHRG 
AVGADKLMGD GAGILIQIPD EYYRAEMALQ GIELPPPGEY GVGMIFLPKE HASRLACEQA
LERAVHAEGQ VLLGWRDVPV DRAMPMSPTV RAKEPVIRQI FIGRGADVIV PDALERKLYV
IRKTASAAIQ KLQLTHSHEY YVPSMSCRTI IYKGLLLADQ VGTYYLDLQD ERTVSALALV
HQRFSTNTFP EWPLAHPYRM VAHNGEINTV KGNFNWMRAR QGVMKSPVLG DDLNKLYPIS
FDGQSDTATF DNALELLTMA GYPLAHAAMM MIPEAWEQHA TMDERRRAFY EYHAAMLEPW
DGPAAMVFTD GKQIGATLDR NGLRPARYIV TDDDLVVMAS ESGVLPIPEN KIVKKWRLQP
GKMFLIDLEQ GRIVDDEELK NTYASAKPYR QWIENVRIKL DAIAAPAEAA PAFAENLLDR
QQAFGYTQED IKFLMSPMAQ TGEEGIGSMG NDSPLAVLSD KNKPLYSYFK QMFAQVTNPP
IDPIREAIVM SLNSFIGPKP NLLDINAVNP PMRLEVAQPI LDFEDMARLR SIEKPTHGKF
KSYELNIVYP YAWGGEGVEA KLASLCAEAV DAIQGGHNIL IITDRRMSRD QIAIPALLAL
SAIHQHLVRE GLRTTAGLVV ETGSAREVHH FGVLAGYGAE AVHPYLAMET LASLHKELPG
DLSADKAIYN YVKAIGKGLS KIMSKMGVST YMSYCGAQLF EAIGLNKPLV EKYFRGTASQ
VGGIGVFEVA EEALRMHRAA FGDDPVLATM LDAGGEYAWR TRGEEHMWTP DAIAKLQHST
RANKFDTYKE YAQLINDQSR RHMTLRGLFE FKLDPSKAIP LDQVEPASEI VKRFATGAMS
LGSISTEAHS TLAIAMNRIG GKSNTGEGGE DPARYRNELK GIPIKQGTMV SEVVGSKVIE
ADYELKDGDS LRSKIKQVAS GRFGVTTEYL VSADQIQIKM AQGAKPGEGG QLPGGKVSEY
IGMLRYSVPG VGLISPPPHH DIYSIEDLAQ LIHDLKNANP RASISVKLVS EVGVGTIAAG
VAKAKSDHVV IAGHDGGTGA SPWSSIKHAG TPWELGLAET QQTLVLNGLR GRIRVQADGQ
MKTGRDVVIG ALLGADEFGF ATAPLVVEGC IMMRKCHLNT CPVGVATQDP VLRAKFQGKP
EHVVNYFFFV AEEARQIMAQ LGIRSFDELV GRADLLDTKK GVSHWKARGL DFGRVFHLPQ
VGADVPRRQV DVQDHGLAKA LDVRLIEKCR PAIERGEKVQ FMDETRNVNR TVGAMLSGEL
IRHRPEGLPD HTIFMQMEGV GGQSFGAFLA QGITLYLIGD ANDYTGKGLS GGRVVVRPSI
DFRGDATQNI IVGNTVLYGA TSGEAFFRGV AGERFAVRLS GATTVVEGTG DHGCEYMTGG
TVVVLGKTGR NFAAGMSGGI AYVYDEDGSF SQRCNTAMVA MDKVLTADEQ RSTQEAAIFH
KGVADEVLLR KLIEDHHRWT GSLRARDILD HWPAARGKFV KVFPHEYKRA LGEIHAKKEA
SETIAKARTN DKKSSKAKA