Gene Mext_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3032 
Symbol 
ID5835501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3378046 
End bp3379698 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content70% 
IMG OID641368832 
Productthreonine dehydratase, biosynthetic 
Protein accessionYP_001640492 
Protein GI163852449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.08767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGAT CCGGCTCTGA CGGTGCGGCC CCTTCCCTCG CCTCCCGCCC TCGCATCCCG 
GCCGCGGAAG GACTAAGTCC CGGGCGTGCC CTGCCCGATG ACGCGGCCGA AAGCCTGCCC
GTGACCGACT ACATCAAGAA GATCCTCTCT GCCCGCGTCT ACGACGTGGC GATCGAGAGC
CCCCTCGATC CGATGCCCCG CCTGACGAAG CGGCTCGGCC GTCCCGTGCT GCTCAAGCGC
GAGGATCTGC AGCCGGTCTT CTCGTTCAAG CTGCGCGGCG CCTACAACAA GATGGCCTCG
CTGCCCCAAG AGCGGCTCGA GAGCGGCGTG ATCTGCGCCT CGGCCGGCAA CCACGCGCAG
GGCGTGGCGC TAGCGGCGGC CAAGCTCGGC GTGCGGGCGG TGATCGTGAT GCCGCGCACG
ACGCCCGCGA TCAAGGTCGA TGCCTGCCGG GCCCGCGGCG CCGAGGTCGT GCTGCACGGC
GACGCCTTCG ACGAGGCTCT GGCGGAGGCT CGGCGCCTCG AAGCGCAGTG GGGCCTGACC
TTCCTGCACC CGTTCGACGA TCCCGAGGTG ATCGCCGGAC AGGGTACGAT CGGCATGGAG
ATCCTGCATC AGCATACCGG ACCGATCGAG GCGATCTTCG TGCCGATCGG CGGCGGTGGC
TTGGCCGCCG GCATCGCCAC CTTCGTGAAA TATCTGCGCC CCGAGACCAA GGTGATCGGC
GTCGAGCCGG ACGACGCCGC CACCATGTCC GAGGCGCTCC GGGCGGGCGA CCGGGTGATG
CTGCCGAGCG TCGGGCTGTT CGCCGACGGC GTCGCGGTGC GGCAGGCCGG CGAGGAGACG
TTCCGGCTCT GCCGCGAGCA TCTCGACGCG GTCATCACCG TCGATACCGA TGCGATGTGC
GCCGCGGTCA AGGACATCTT CGACGACACC CGCGCGATCT CCGAGCCGTC GGGCGCACTG
AGTCTGGCCG GCGCCAAGGC CTGGTCAGCG AAGAATCCCG GTGCCGGGCC GCTGGTGGCG
ATCTCGTCGG GTGCCAACCT CAACTTCGAC CGCCTGCGCC ACATCGCCGA GCGGGCGGAG
ATCGGCGAGG AACGCGAGGT GCTGCTCGGC GTCACCATCC CCGAACGGCC GGGCGCCTAC
CGTGCCTTCA TCGGCGCGCT CGGGCCCCGC GCGATCACCG AATTCAACTA CCGCTACGCG
CAAGGCAGCG ACGCGCGCAT CTTCGTCGGC ATCAACCTGC CCGGCGGCAA GCCCGAGAAG
CGCGACCTGA TCGCCGCCCT GGAGAGCGCC GGCTACCGCG TCGCCGATAT GAGCGACAAC
GAGATGGCCA AGGTGCATGT CCGCTACATG GTGGGCGGCC GCGCGGCGGG GCTCGCCGAC
GAGCGGCTCT ACCGCTTCCA GTTCCCCGAG CGGCCGGGCG CGCTGATGAA GTTCCTCGAA
GCGCTCGGCG ACGGCTTCAA CATCAGCCTG TTCCACTACC GCAATCACGG CGCCGATTAC
GGCCGTGTGC TCGCGGGGAT CGAGGTGCCG GCAGCGGAGC GCGCCCGCTT CGAGGCCGCC
CTCGAAGCGC TCGCCTATCC CTATGTCGAT GAGACCGACA ACCCGGCTTA CCGGCTGTTC
CTCGACAACG GCATCGGAGC GGCCGACCAC TGA
 
Protein sequence
MPGSGSDGAA PSLASRPRIP AAEGLSPGRA LPDDAAESLP VTDYIKKILS ARVYDVAIES 
PLDPMPRLTK RLGRPVLLKR EDLQPVFSFK LRGAYNKMAS LPQERLESGV ICASAGNHAQ
GVALAAAKLG VRAVIVMPRT TPAIKVDACR ARGAEVVLHG DAFDEALAEA RRLEAQWGLT
FLHPFDDPEV IAGQGTIGME ILHQHTGPIE AIFVPIGGGG LAAGIATFVK YLRPETKVIG
VEPDDAATMS EALRAGDRVM LPSVGLFADG VAVRQAGEET FRLCREHLDA VITVDTDAMC
AAVKDIFDDT RAISEPSGAL SLAGAKAWSA KNPGAGPLVA ISSGANLNFD RLRHIAERAE
IGEEREVLLG VTIPERPGAY RAFIGALGPR AITEFNYRYA QGSDARIFVG INLPGGKPEK
RDLIAALESA GYRVADMSDN EMAKVHVRYM VGGRAAGLAD ERLYRFQFPE RPGALMKFLE
ALGDGFNISL FHYRNHGADY GRVLAGIEVP AAERARFEAA LEALAYPYVD ETDNPAYRLF
LDNGIGAADH