Gene Mpe_A2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2157 
Symbol 
ID4785821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2312915 
End bp2314153 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content67% 
IMG OID640090725 
Producttryptophan synthase subunit beta 
Protein accessionYP_001021348 
Protein GI124267344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.595682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.092632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATT ACCAGCAACC CGATGCGAGC GGCCATTTCG GCCGCTATGG CGGCAGCTTC 
GTCGCCGAGA CCCTGATCCA CGCGCTCGAC GAACTGAAGG CCGCCTACGC GCGCTATCGC
GACGATCCCG AGTTCGTGGC CGAGTTCAAG AGCGAGCTCG CGCATTTCGT CGGCCGACCC
AGTCCGATCT ACCACGCCGC GCGCATGAGC CGCGAGCTCG GTGGCGCCCA GATCTACCTG
AAGCGCGAGG ACCTCAACCA CACCGGCGCC CACAAGATCA ACAACACCAT CGGCCAGGCG
CTGCTGGCCC GGCGCATGGG CAAGCCGCGC GTGATCGCCG AAACCGGTGC CGGCCAGCAC
GGCGTGGCCA CCGCCACCAT CTGTGCCCGC TACGGCATGG AATGCGTGGT CTACATGGGC
AGCGAGGACG TGAAGCGCCA GTCGCCCAAC GTCTACCGCA TGCACCTGCT GGGCGCCAGG
GTGGTGCCGG TGGACAGCGG CAGCAAGACG CTGAAGGACG CGCTGAACGA GGCGCTGCGC
GACTGGGTCA CCAACGTCGA GAACACCTTC TACATCATCG GCACCGTGGC CGGCCCGGCG
CCGTACCCGG AGATGGTGCG TGACTTCCAG AGCGTCATCG GCGAGGAATG CCTGCGGCAG
ATGCCGGAGA TGGCGGGTCG CCAGCCCGAC GCGGTGATCG CCTGCGTCGG CGGCGGCAGC
AATGCGATGG GCATCTTCTA CCCGTATATC CGACACGAGG GCGTGCGCCT GATCGGCGTG
GAGGCGGCCG GACACGGGCT CGACTCCGGC AAGCATGCGG CCAGCCTCAG CGCCGGCTCG
CCGGGCGTGC TGCACGGCAA CCGCACCTAC CTGTTGCAGG ACGCGAACGG CCAGATCATC
GAGACGCACT CGATCTCCGC CGGACTCGAT TACCCCGGCG TCGGCCCTGA GCACGCCTAC
CTGAAGGACA TCGGGCGGGC CGAGTACGTC GGCATCACCG ACGACGAGGC GCTGCAGGCC
TTTCACCGGC TGTGCCGCAC CGAAGGCATC ATCCCGGCGC TTGAATCCAG CCATGCGGTG
GCCTACGCGA TGAAACTGGC GCCGACGATG CGCAGCGACC AGAGTCTGCT GGTCAATCTG
TCCGGCCGGG GCGACAAGGA CATCGGCACC GTTGCCGACC TGTCCGGCGC CGAGTTCTAC
GACCGGCCGT CGTCGCGCGG CGAGAAGGTG AAGCAATGA
 
Protein sequence
MLNYQQPDAS GHFGRYGGSF VAETLIHALD ELKAAYARYR DDPEFVAEFK SELAHFVGRP 
SPIYHAARMS RELGGAQIYL KREDLNHTGA HKINNTIGQA LLARRMGKPR VIAETGAGQH
GVATATICAR YGMECVVYMG SEDVKRQSPN VYRMHLLGAR VVPVDSGSKT LKDALNEALR
DWVTNVENTF YIIGTVAGPA PYPEMVRDFQ SVIGEECLRQ MPEMAGRQPD AVIACVGGGS
NAMGIFYPYI RHEGVRLIGV EAAGHGLDSG KHAASLSAGS PGVLHGNRTY LLQDANGQII
ETHSISAGLD YPGVGPEHAY LKDIGRAEYV GITDDEALQA FHRLCRTEGI IPALESSHAV
AYAMKLAPTM RSDQSLLVNL SGRGDKDIGT VADLSGAEFY DRPSSRGEKV KQ