Gene Mpe_A1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1542 
Symbol 
ID4783560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1663021 
End bp1664433 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content67% 
IMG OID640090109 
Productthreonine synthase 
Protein accessionYP_001020739 
Protein GI124266735 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTACC TCAGCACCCG CGGCGACTCC ACGCCGCGCG GCTTCTCCGA CATCCTGCTC 
GAAGGCCTGG CGCCGGATGG GGGTCTCTAC CTGCCGGAGC GCTATCCGCG CGTCGATGCG
GCCACGCTGG CGCGCTGGCG CGGCCTGTCC TACGCCGACC TCGCGTTCGA GATCCTGTCG
CTCTACATCA CCGACATTCC GGCCGACGAC CTGCGCACGC TGATCCGCAA GACCTACACG
CGCGAGGTCT TCGGTACGAC CGAGATCACC CCGCTGAAGC CGCTGGAGCC TGGCGTCGCG
CTGCAGGCCC TGTCGAACGG ACCGACGCTC GCGTTCAAGG ACATGGCCAT GCAGCTGCTG
GGCCAGCTGT TCGAGTACGA GCTGGCGCGC CGCGGCGAAA CGCTCAACAT CCTGGGCGCG
ACCTCCGGCG ACACCGGCAG CGCGGCCGAG TACGCGATGC GCGGCAAGCA GGGCGTGCGC
GTCTTCATGC TCAGTCCGCA CGGCCGCATG AGCCCGTTCC AGCAAGCGCA GATGTTCAGC
TTGCCCGACG CGAACATTCA CAACCTCGCG GTCGAGGGCG TGTTCGACGA CTGCCAGGAC
ATCGTCAAGG CCGTGTCCAA CGACCTGGAG TTCAAGCGCC GCTGGCGCAT CGGCACCGTC
AACTCCATCA ACTGGGCAAG GCTGCTGGCG CAGGTGGTGT ATTACTTCGC CGGTTATTTC
CAGGCCACGA AGTCGAACGA CGAGCGGGTG AGCTTCTCGG TGCCATCGGG CAACTTCGGC
AACGTGTGCG CCGGCCATGT GGCGCGCATG ATGGGCCTGC CGATCGAGCA GCTGGTGGTC
GCGACAAACG AGAACGACGT GCTCGACGAG TTCTTCCGCA CCGGCAGCTA CCGCGTGCGC
GGCGCGGCCG ACACGCATGA GACTTCGAGT CCCTCGATGG ACATCTCCAA GGCCAGCAAC
TTCGAGCGCT TCGTGTTCGA CCTGCTGGGT CGCGACGCGG CACGCACGCG CCAGCTGTTC
GGTGACGACA TCGCACGGCA CGGCGGCTTC ACGCTGACGC CCGCCGAGTT TGCGCGGGTG
CGCGAGTTCG GCTTCGTGTC CGGCAAGAGC ACGCACGCCG ACCGCGTGGC GACCATCCGC
GACACTCAGC AGCGTTTCGG CGTGACCATC GACCCGCACA CCGCCGACGG CCTGAAGGTG
GGCCGTGCGT ACGTGAAGCC CGGCACGCCG CTGCTGGTGC TGGAGACCGC GCTGCCGATC
AAGTTCGCCG CCACCATCCT CGAGGCGCTC GGCCACGAGC CGCCCCGGCC GGCCGGGCTC
GAGGGGCTCG AGCAGCTCCC CAGGCGCTTC AAGGTCATGC CGGCCAGCTC GGCGACGGTG
CAGGCCTACA TCGTCGAGAA TTGTTCGACA TGA
 
Protein sequence
MNYLSTRGDS TPRGFSDILL EGLAPDGGLY LPERYPRVDA ATLARWRGLS YADLAFEILS 
LYITDIPADD LRTLIRKTYT REVFGTTEIT PLKPLEPGVA LQALSNGPTL AFKDMAMQLL
GQLFEYELAR RGETLNILGA TSGDTGSAAE YAMRGKQGVR VFMLSPHGRM SPFQQAQMFS
LPDANIHNLA VEGVFDDCQD IVKAVSNDLE FKRRWRIGTV NSINWARLLA QVVYYFAGYF
QATKSNDERV SFSVPSGNFG NVCAGHVARM MGLPIEQLVV ATNENDVLDE FFRTGSYRVR
GAADTHETSS PSMDISKASN FERFVFDLLG RDAARTRQLF GDDIARHGGF TLTPAEFARV
REFGFVSGKS THADRVATIR DTQQRFGVTI DPHTADGLKV GRAYVKPGTP LLVLETALPI
KFAATILEAL GHEPPRPAGL EGLEQLPRRF KVMPASSATV QAYIVENCST