Gene Mpe_A1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1947 
Symbol 
ID4786708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2083042 
End bp2084244 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content69% 
IMG OID640090517 
Productaminotransferase 
Protein accessionYP_001021140 
Protein GI124267136 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.564759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.128663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG CCCACCTCCT GGCCCGCACG CCCGTGCCGC CCAGCCGCCT GCCGCGGGTG 
GGCACGACGA TCTTCACCGT GATGTCCGCG CTGGCCCAGG AGCACGGCGC GGTCAACCTC
GGCCAGGGCT TCCCGGACTT CGAGTGCGAC CCGCGCCTGG TCGATGCCGT GACGCAGGCC
ATGCAGGCAG GCCACAACCA GTACCCGCCG ATGGCCGGCG TGCCGGTGTT GCGCGAAGCA
GTCGCCGCAA AGATCGCCGC GCTCTATGGT CACCGCTACG ACCCCGGCAG CGAGATCACC
ATCACCGCTG GCGCGACCCA GGCCATCCTG ACCGCGATCC TCGCGCTGGT GCATCCGGGC
GACGAGGTGA TCGTCCTTGA GCCCTGCTAC GACAGCTATG CGCCAAACAT CGAACTGGCC
GGCGGCCGAG TACGGCGGGT GCCGCTGACG CCGGGCCGCT TCCGGCCGGA CTTCGACCGT
ATCGCCGCGG CGCTCGGGCC GCGCACCCGC GCAATCCTCG TCAACACGCC GCACAACCCG
AGCGCCACGG TGTGGACCGC CGGCGAGATG CAGCGCCTGG CCGATCTGCT GCGGCCCACC
AACGTGATCG TCATCGCCGA CGAGGTCTAC GAGCACATGG TGTTCGACGG CCAAGCCCAC
CAGAGCGTGG CTCGCCATGC GGAGCTCGCC GCGCGCTCCG TCATCGTGTC GAGCTTCGGA
AAGACCTTTC ATGTGACCGG CTGGAAGGTG GGCTACGCCG CGGCCCCGGC CGAACTGATG
GCGGAGTTCC GCAAGGTGCA TCAATTCAAT GTGTTCACCG TCAACACGCC GGTGCAGCAC
GCGCTGGCCG CCTACCTGGG CGACCCTCGC CCCTACCTGG ACCTGCCGGA TTTTTATGCA
CGCAAGCGCG ACCGCTTCCG CGCCGGGCTC GCGGACACCG GCCTCGACCT GATGCCCAGC
GAAGGCAGCT ACTTCCAGTG CGTGGGTTAT GGCGGCCTGG CCGCGCATCG GGCGCGCAGC
GAAGCCGAGT TCTGCCGCTG GTTGACCACC GAGGCCGGCG TCGCGGCGAT TCCGCTGTCG
GCGTTCTACG ACGCCGGATT CGAACAGCGG GTCGTGCGCT TCTGCTTTGC CAAGCGCGAA
GGCACGCTGG ATGCCGCGTT GCAGCGGCTG CGCACGGCGC TGTCCGCGCG ATCTCCCGGC
TGA
 
Protein sequence
MSSAHLLART PVPPSRLPRV GTTIFTVMSA LAQEHGAVNL GQGFPDFECD PRLVDAVTQA 
MQAGHNQYPP MAGVPVLREA VAAKIAALYG HRYDPGSEIT ITAGATQAIL TAILALVHPG
DEVIVLEPCY DSYAPNIELA GGRVRRVPLT PGRFRPDFDR IAAALGPRTR AILVNTPHNP
SATVWTAGEM QRLADLLRPT NVIVIADEVY EHMVFDGQAH QSVARHAELA ARSVIVSSFG
KTFHVTGWKV GYAAAPAELM AEFRKVHQFN VFTVNTPVQH ALAAYLGDPR PYLDLPDFYA
RKRDRFRAGL ADTGLDLMPS EGSYFQCVGY GGLAAHRARS EAEFCRWLTT EAGVAAIPLS
AFYDAGFEQR VVRFCFAKRE GTLDAALQRL RTALSARSPG