Gene Mpe_A0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0312 
Symbol 
ID4786862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp337608 
End bp338519 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content70% 
IMG OID640088864 
Productbranched chain amino acid: 2-keto-4-methylthiobutyrate aminotransferase 
Protein accessionYP_001019509 
Protein GI124265505 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01121] D-amino acid aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.381931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00369905 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCCGC TGCCCAGCGC CATCCCCGCG GCGAGCGCCG ACTCGCTGTG CTACCTGAAC 
GGCGACTACA CCCGCCTGGC GGACGCTCGC GTCAGCGTGC TCGACCGCGG CTTCATGTTC
GGCGACGGCG TCTACGAGGT CCTGCCCGTC TACGATCGTC GGCTGTTCCG CTTCGACGAG
CACATGGCGC GCCTGGAGCG CAGCCTCGCC AAGGTGCGCA TCACCGCGCC GCTGACCCGC
GAGGACTGGC TGGCGCGCAT GCGCCGGCTG GTCGCGGCCC AGCACGAGCA CAGCGGGGCG
ACCGACCAGC TCGTGTACCT GCAGGTCACG CGCGGCGTGG CGCTGCGCGA GCACACGATG
CCGACCGACA TCGAGCCCAC GGTCTTCATG ATGTGCAGTC CGGCGAAGCC GCCGACGCCC
GAGCAGCGCC ATGCCGGCGT GGCCTGCATC AGCGCGCGCG ACTTCCGCTG GGAGCGCGGC
GACATCAAGA GCATTTCGCT GCTCGGCAAC GTGCTGGCGC GGCAGATGTC GGCCGACAAG
GGCGCCGTCG AGACCCTCCT GTTTCGCGAC GGCTTCCTGA CCGAGGCAGC GGCGTCCAAC
GTGTGGATGG TGAAGGAAGG CGCACTGATC GGCCCGCCGA AGAGCGAACT GCTGCTCGAA
GGCGTGCGGG TCGACCTGCT GGCCGAGCTG TGCGAGGAGT GCGGCATCGG CTACAGCCTG
CGGCCGGTCA GCGAGGGCGA GGTCTTCTCG GCCGACGAAC TGCTGCTGAG TTCGGCGATG
AAGGAAGTGC TGGCGGTCAC CCGTCTCGAT GGCGAACTGG TCGGGCACGG CGCGTTGCGC
GGCAAGCCCG GGCCGGTGTA CGCCCGGCTC TACGAGGCCT ACCAGCGGGC CAAGCCCGCC
CAGTCGATCT GA
 
Protein sequence
MNPLPSAIPA ASADSLCYLN GDYTRLADAR VSVLDRGFMF GDGVYEVLPV YDRRLFRFDE 
HMARLERSLA KVRITAPLTR EDWLARMRRL VAAQHEHSGA TDQLVYLQVT RGVALREHTM
PTDIEPTVFM MCSPAKPPTP EQRHAGVACI SARDFRWERG DIKSISLLGN VLARQMSADK
GAVETLLFRD GFLTEAAASN VWMVKEGALI GPPKSELLLE GVRVDLLAEL CEECGIGYSL
RPVSEGEVFS ADELLLSSAM KEVLAVTRLD GELVGHGALR GKPGPVYARL YEAYQRAKPA
QSI