Gene Mpe_A2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2950 
Symbol 
ID4784372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3133218 
End bp3134399 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content72% 
IMG OID640091521 
Productaminotransferase 
Protein accessionYP_001022138 
Protein GI124268134 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00573784 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.780018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG CCCGGCGCCT CGACCGCATC GAGCCGTTCT ACGTGATGGA GTGCGCCAAG 
GCAGCGGCGC GCATCGCTGC CAGCCCGGCC TGCGATCCGG CACGCGGCGG CGAGCCGATG
ATCTACCTGA ACATCGGTGA ACCCGACTTC ACCGCGCCGG CGCCGGTGCT CGAGGCGGCG
CAGCGCTGCC TCGCCGAGGG CCGCACGCAG TACACCCCGG CCACCGGGCT GCCTGCGCTG
CGCGAGGCGC TGTCGCGCTG GTATGCGCAG CGCTTCGCGC TCGACATCGA CCCGCAGCGC
ATCGTCATCA CGGCCGGCGC GTCGGCCGCG CTGCAGCTCG CCTGCCTGGC GCTTTTCGAG
TCCGGCGACG AGGTGCTGAT GCCGGATCCG AGCTACCCCT GCAATCGCCA TTTCGTCGCG
GCGGCCGACG CCACGCCGGT GCTGCTGCCC TGTGGCCCGG TGCAGCGCTA TCAGCTCGAC
GCCGCCGGGG TGGAGCGGGC GTGGAACGCG CGCACCCGCG GCGTGCTGCT GGCGTCGCCG
TCCAACCCCA CCGGCACTTC GATCGCCGCC GACGAGATGC AGCGCATCGC ACAGGCGGTG
CGCGCACGCG GCGGCGTCAC GCTGGTCGAC GAGATCTACC TGGGCCTGAG CTACGACGCC
GCCTACGGCC GCTCCGCGCT CGCCCACGGC GACGACGTGG TGTCGATCAA CAGCTTCTCC
AAATACTTCA GCATGACCGG CTGGCGGCTG GGCTGGCTGG TGCTGCCGCC CGCGCTGGTG
GCGCCGGTCG AGAAGCTGGC GCAGAACCTG TTCATCTGCC CGTCCAGCGT CGCGCAGCAC
GCGGCGCTGG CCTGCTTCGA GCCCGCATCG ATCGCCGAGT ACGAACGCCG CCGTGCCGCC
TTCCGGGCCC GGCGCGACTA CATCGTGCCC GCGCTCGCGT CGCTCGGCCT GCCGGTGCCG
GTGCTGCCCG ACGGCGCCTT CTATGCCTGG GCCGACTGCT CGACCCACGC CGGCAGCAGC
TGGGACCTGG CGTTCGCGCT CATGGATTCG GCCCATGTCG CGCTGACGCC GGGGCGCGAT
TTCGGCCGGC ACGGCACTGA ACACCACCTG CGACTGTCCT TCGCCAGCAG CCTGCCGCAG
CTCGAACAGG CCGTCGCGCG CCTGCGGCGC GTGCTGGCAT GA
 
Protein sequence
MKLARRLDRI EPFYVMECAK AAARIAASPA CDPARGGEPM IYLNIGEPDF TAPAPVLEAA 
QRCLAEGRTQ YTPATGLPAL REALSRWYAQ RFALDIDPQR IVITAGASAA LQLACLALFE
SGDEVLMPDP SYPCNRHFVA AADATPVLLP CGPVQRYQLD AAGVERAWNA RTRGVLLASP
SNPTGTSIAA DEMQRIAQAV RARGGVTLVD EIYLGLSYDA AYGRSALAHG DDVVSINSFS
KYFSMTGWRL GWLVLPPALV APVEKLAQNL FICPSSVAQH AALACFEPAS IAEYERRRAA
FRARRDYIVP ALASLGLPVP VLPDGAFYAW ADCSTHAGSS WDLAFALMDS AHVALTPGRD
FGRHGTEHHL RLSFASSLPQ LEQAVARLRR VLA