Gene Mpe_A1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1540 
Symbol 
ID4783558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1660406 
End bp1661635 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID640090107 
Productaminotransferase AlaT 
Protein accessionYP_001020737 
Protein GI124266733 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.790745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAAGC CCATCGCCAA GTCCAGCAAG CTCGCCAACG TGTGCTACGA CATCCGTGGT 
CCGGTGCTGG ACAAGGCCCG CGCCATGGAA GAAGAGGGCC AGAAGATCAT CAAGCTCAAC
ATCGGCAATC TGGCGGTGTT CGGCTTCGAT CCGCCCGACG AGATCGTGCA GGACATGATC
CGCAATCTTT CCCAGACGGC GGGCTACACC GACAGCAAGG GTCTGTTCGC ACCGCGCAAG
GCGGTGGTGC ATTACGCGCA GGAGAAGGCG ATCTCCGGCG TGACGGTCGA CGACGTCTAT
CTCGGCAACG GTGCGTCCGA GCTGATCGTG ATGAGCCTCA ACGCGCTGCT CGACAACGGC
GACGAGGTGC TGGTGCCGGC ACCCGATTAC CCGCTGTGGA CTGCGGCCGT GTCGCTGTCG
GGCGGCAATC CGGTGCACTA CCTGTGCGAC GAGGCGAGCG ACTGGTACCC GGACATCGAC
GACATCCGCC GAAAGATCAC GCCGAACACA CGCGCCATCG TCGTCATCAA CCCGAACAAC
CCGACCGGCG CGCTGTACCC GGAGAGTCTG CTGCGCGAGA TCGTCGAACT GGCCCGGCAG
CACCAGCTGA TCATCTTTGC CGATGAGATC TACGACAAGA CGCTGTACGA CGGCAACACC
CATACCAGCA TCGCTTCGCT GGCCGACGAC GTGCTGTGCG TGACGCTGAA CGGCCTCAGC
AAGAACTACC GTGCCTGCGG CTACCGCGCC GGCTGGATGG TGGTGTCGGG CGACAAGCGC
TGCGCGAAGG ACTACATCGA GGGGCTCAAC ATGCTCGCGT CGATGCGCCT GTGCGCCAAC
ACACCGGGGC AGCTGGCCAT CCAGACCGCG CTCGGCGGCT ACCAGAGCAT CAAGGACCTC
GTGGCGCCGG GCGGCCGGCT GACGCGCCAG CGCGACATGG CCTACGAGCT GCTGAGCCAG
ATTCCCGGCG TGAGCGTGGT CAAGCCCAAG GCGGCGCTGT ACATGTTCCC ACGGCTCGAC
CCCAAGCTTT ACCCGATCCA GGACGACCAG CAGTTCGCCT ACGAACTGCT GGCCGAGGAG
AAGGTGCTGA TCGTGCAGGG CACGGGCTTC AACTGGCCGC AGCCCGACCA TTTCCGCGTC
GTGTTCCTGC CGAACTCCGA CGACCTGAGC GACGCGATCG GCCGCATCGC CCGTTTCCTC
GACCATTACC GCAAGCGCCA CAGCGTCTGA
 
Protein sequence
MPKPIAKSSK LANVCYDIRG PVLDKARAME EEGQKIIKLN IGNLAVFGFD PPDEIVQDMI 
RNLSQTAGYT DSKGLFAPRK AVVHYAQEKA ISGVTVDDVY LGNGASELIV MSLNALLDNG
DEVLVPAPDY PLWTAAVSLS GGNPVHYLCD EASDWYPDID DIRRKITPNT RAIVVINPNN
PTGALYPESL LREIVELARQ HQLIIFADEI YDKTLYDGNT HTSIASLADD VLCVTLNGLS
KNYRACGYRA GWMVVSGDKR CAKDYIEGLN MLASMRLCAN TPGQLAIQTA LGGYQSIKDL
VAPGGRLTRQ RDMAYELLSQ IPGVSVVKPK AALYMFPRLD PKLYPIQDDQ QFAYELLAEE
KVLIVQGTGF NWPQPDHFRV VFLPNSDDLS DAIGRIARFL DHYRKRHSV