Gene Mpe_A3770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3770 
Symbol 
ID4785999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3988094 
End bp3989332 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content71% 
IMG OID640092353 
Productthreonine dehydratase 
Protein accessionYP_001022958 
Protein GI124268954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0248886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTCAG CCGAGTCGCC GGCGACGGTG ACCCTGGCGG ACGTCGAGGC CGCTGCACGC 
CGGCTGGCCG GACAGGTGCT CGACACACCG TGTGTCGAGT CGAAGACGCT GTCCGAGATC
ACCGGCGCGC AGGTCTTCCT GAAGTTCGAG AACCTGCAGT TCACCGCTTC CTTCAAGGAG
CGCGGCGCGT TGAACAAGCT GCTGGCGCTG GTCGCCGCGC GCGACGACGG CACCGCCGCG
TTGAAGGGCG TGATCGCCGC CTCCGCCGGC AACCATGCGC AGGGCGTGGC GCATCACGCG
CAGCGGCTGG GCCTGCGCGC GGTCATCGTG ATGCCCCGGC ACACGCCGAC GGTGAAGGTC
GAGCGCACGC GCGGCTTCGG TGCCGAGGTG CTGCTGCACG GCGAGAGCTT CGACGAGGCT
CGCGCGCACG CGCTGGAGGT GGCCGCGGCG CAGGGCCTGA GCTTCGTGCA CCCCTTCGAC
GATCCGCTGG TGATCGCCGG CCAGGGCACG ATCGGCCTGG AGATGCTGCG CGCGCAACCG
ACGCTCGACA CTCTGGTCAT CGCCGTCGGC GGCGGCGGGC TGATCTCCGG CATCGCGACC
GTCGCCCGGG CGCTGAAGCC CGGGCTGGAG ATCGTCGGGG TGCAGACGGC GCGCTTTCCG
GCGATGGTGA ACCTCGTCAA GGGCCGCACC CATCCGCAAG GCACCAGCAC GATCGCCGAG
GGCATTGCCG TCGGTGAGCC GGGGCGGATC ACGCGGGCGA TCGTTCGCGA CCTCGTCGAC
GACATGGTGC TGGTGGACGA AGCGGACATC GAGCATGCCC TGGTGATGCT GCTGGAGATC
GAGAAGACGC TGGTCGAGGG GGCCGGCGCC GCGGGTCTGG CTGCGCTGCT GAAGGCGCCC
GGGCGCTACG CCGGGCGACG CGTGGGCCTG GTGCTGTGCG GCGGCAACAT CGATCCGCTG
CTGCTGTCGG CGATCATCGA GCGCGGCATG GTGCGGGCGG GGCGGCTGGC ACGCATCCGC
GTCGACGCGC GCGACGCCCC CGGCGCGCTG GCGCGCATCA CCGCCACCGT GGCCGAGGCC
GGCGCGAACA TCGAGGAAGT GCATCACCAG CGGGCCTTCA CCACGCTGTC GGCGCAGAAC
GCCGAGGTGG AGTTGGTGTT GCAGACGCGC AATCACGCCC ATATCGGCGC AGTGATCGCG
TCCCTGGTGG CTCAGGGGTT CGAGGCCAAG GCCTACTGA
 
Protein sequence
MTSAESPATV TLADVEAAAR RLAGQVLDTP CVESKTLSEI TGAQVFLKFE NLQFTASFKE 
RGALNKLLAL VAARDDGTAA LKGVIAASAG NHAQGVAHHA QRLGLRAVIV MPRHTPTVKV
ERTRGFGAEV LLHGESFDEA RAHALEVAAA QGLSFVHPFD DPLVIAGQGT IGLEMLRAQP
TLDTLVIAVG GGGLISGIAT VARALKPGLE IVGVQTARFP AMVNLVKGRT HPQGTSTIAE
GIAVGEPGRI TRAIVRDLVD DMVLVDEADI EHALVMLLEI EKTLVEGAGA AGLAALLKAP
GRYAGRRVGL VLCGGNIDPL LLSAIIERGM VRAGRLARIR VDARDAPGAL ARITATVAEA
GANIEEVHHQ RAFTTLSAQN AEVELVLQTR NHAHIGAVIA SLVAQGFEAK AY