Gene Mpe_A3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3620 
Symbol 
ID4786146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3828412 
End bp3830076 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content70% 
IMG OID640092202 
Productputative glutamate synthase 
Protein accessionYP_001022808 
Protein GI124268804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.084141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.179969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCT GGCTGTTGCG TCTGAACGAA CACTTCCCGA TCCGCTACTT CGTCTGGCTG 
GCCTGTGCGG TCGGCGCGTT GCTGAGCGCG CTGGTGTGGG TGGTGACCGG CAGCGGCGGC
CTGGTGCTGC TGGTGTTCTT GGCGCTCGTG GGCACCGGCG TGCACGACGT GCGGCAGTCG
CGCCACTCGG TGCTGCGCAA CTACCCGGTG ATCGGCCACC TGCGCTTCTT CTTCGAGTTC
ATCCGGCCGG AGATGCGGCA GTACTTCATC GAGGGCGACA ACGAGGCGGC GCCGTTCTCG
CGCCAGCAGC GCTCGCTGGT CTACCAGCGC GCCAAGGGCG AGCCCGACAA GCGGCCGTTC
GGCACGCAGC ACGACGTGGG CGCCGAGGGC TACGAGTGGA TCAACCACTC GATCGCGCCC
ACCACGCTCG CGAGCCATGA CTTCCGCATC ACGGTGGGCG GCGAGCGGGC GCAGCCGTAC
AGCGCGTCGA TCTTCAACAT CTCGGCGATG AGCTTCGGTG CGCTGAGCGC CAACGCCATC
CTCGCGCTCA ACGCCGGCGC CAAGCGCGGC GGCTTCGCGC ACGACACCGG CGAGGGCTCG
ATCAGCCGCT ACCACCGCGA GCACGGCGGC GACCTGATCT GGGAGATCGG CTCGGGCTAC
TTCGGCTGCC GCCACGACGA CGGCTCGTTC AGCGAGGAGC GCTTCGCCGA GACGGCGCGC
GACCCGCAGG TGAAGATGAT CGAGCTCAAG CTGAGCCAGG GTGCCAAGCC CGGCCACGGC
GGCGTGCTGC CGGGCCCGAA GGTGACGCCG GAGATCGCTG CGGCGCGCGG CGTGGCGGTG
GGCGTGGATT GCGTGTCGCC GTCGCGCCAT GCCGCCTTCG ACTCGCCGGT GGGCATGCTG
CAGTTCATCG AGAAGCTGCG CACGCTGTCG GGCGGCAAGC CGGTGGGCTT CAAGCTGTGC
ATCGGCCACC CGTGGGAGTG GTTCGCGATC GCCAAGGCGA TGCGAGAGAC GAATCTGCTG
CCGGACTTCA TCGTGGTCGA CGGCGCCGAG GGCGGCACCG GCGCGGCGCC GCTGGAGTTC
ACCGACCACG TGGGCGCGCC GCTGCAGGAA GGGTTGATGC TGGTGCACAA CACGCTGACC
GGCATCGGGC TGCGCGATCG CATCCAGCTC GGCTGCGCCG GCAAGGTGGT GAGCGCCTTC
GACATCGCGC GGCTGATGGC GCTGGGCGCC GACTGGTGCA ACGCGGGGCG CGGCTTCATG
TTCGCGCTGG GCTGCATCCA GGCGCAGGCC TGCCACACCG GCCACTGCCC GACCGGCGTG
ACCACGCAGG ACCCGATGCG CCAGAAGGCG CTGGTGGTGC CGACCAAGGC CGACCGCGTG
TTCATGTTCC ACCAGGAGAC GCTGCGCGCG CTGAAGGAGC TGGTGCAGGC CGCCGGGCTG
CAGCACCCGC GCGAGATCAC CGCCGCGCAC ATCGTGCGGC GCGTGGCCGA CCACGAGGTG
AAGCTGCTGG CGACGCTGCT GCCGTTCGTG AAGCCGGGCG CGCTGCTGGC CGCCGAGCGC
GGCGAGATCG ACTGGCCGCA CCAGGTGTTC CGGCTCTACT GGCCGCGCGC GAGTGCGCAT
TCCTTCTTGC ACCAGCCCGA TCCGCTGGCG ACAGCGGCCG CCTGA
 
Protein sequence
MPAWLLRLNE HFPIRYFVWL ACAVGALLSA LVWVVTGSGG LVLLVFLALV GTGVHDVRQS 
RHSVLRNYPV IGHLRFFFEF IRPEMRQYFI EGDNEAAPFS RQQRSLVYQR AKGEPDKRPF
GTQHDVGAEG YEWINHSIAP TTLASHDFRI TVGGERAQPY SASIFNISAM SFGALSANAI
LALNAGAKRG GFAHDTGEGS ISRYHREHGG DLIWEIGSGY FGCRHDDGSF SEERFAETAR
DPQVKMIELK LSQGAKPGHG GVLPGPKVTP EIAAARGVAV GVDCVSPSRH AAFDSPVGML
QFIEKLRTLS GGKPVGFKLC IGHPWEWFAI AKAMRETNLL PDFIVVDGAE GGTGAAPLEF
TDHVGAPLQE GLMLVHNTLT GIGLRDRIQL GCAGKVVSAF DIARLMALGA DWCNAGRGFM
FALGCIQAQA CHTGHCPTGV TTQDPMRQKA LVVPTKADRV FMFHQETLRA LKELVQAAGL
QHPREITAAH IVRRVADHEV KLLATLLPFV KPGALLAAER GEIDWPHQVF RLYWPRASAH
SFLHQPDPLA TAAA