Gene Mext_1157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1157 
Symbol 
ID5833939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1267660 
End bp1268769 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content71% 
IMG OID641366950 
ProductNitrilase 
Protein accessionYP_001638630 
Protein GI163850587 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.932259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.338424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCC AATACCCGAA GTTCAAGGCG GCCGCCTGCC ACGTCGCCTC GGTCTTCCTC 
GACAGCACCG CCTCCGCTGA GAAGGCCGTT GCGCTGATCG GCGAGGCCGC CCGCGCGGGC
GCCGACCTCG TGGTCTTTCC CGAGGGCTAC ATGCCGGGCT TCCCGCTCTG GGCGGCTCTG
CGGGCGCCGA TCCACAACCA CGATCTGTTC AAGCGCCTCG CGGCCCAGTC CGTGCGCCTG
GACGGCCCGG AGATCGGCGC GGTCCGCGCC GCCGCCCGGC GCCACGGCGT GCTCGTCTCG
CTCGGGTTCA GCGAGAGCAC CGAGGCCAGC GTCGGCTGCC TGTGGAACGC GAACGTGCTG
ATCGGGCGCG ACGGGGCGAT CCTCAACCAC CACCGCAAGC TCGTGCCGAC CTTCTACGAA
AAGCTCATCT GGGCGAACGG CGACGCCCGG GGCTTGCGCG TGACGCGCAC CGAGATCGGC
CGCGTCGGCA TGCTGATCTG CGGTGAGAAC ACCAATCCGC TGGCCCGGTA CACGCTGATG
GCCCAGGGCG AGCAGGTCCA CATCTCGACC TACCCGCCGG CTTGGCCGAC GCGCCCGCCG
GGAGAGAGCG CCGCCTACGA CCTGAAACGG GCCATCGAGA TCCGTGCCGG GGCGCATGCT
TTCGAGGCCA AGGTGTTCAA CATCGTCTGC TCCGCCGTCC TCGACGCGGC CGCCAGGGCC
ACCCTCTGCG ACGGGGACGC TGCCCTCGCC GAACTCGTCG AGCGGACCCC CGCGGGCGTG
TCCATGGTCC TCGACCCCAC CGGCTCCCAT GTCGTCGAGC CGCACCAGGG GGACGAGACG
ATCGTCTACG CCGACATCGA CGTCGAGGCC TGCGTCGAGC CCAAGCAGTT CCACGACGTC
GTCGGCTACT ACAACCGCTT CGACATCTTC CGCCTCCATG TCGACCGCAC GCCGCGCGAG
CCGATCAGCT TCGACGCGGC CGCCCGGCCG TCGGGCGTTG CCGCCGACGG CGTCGATGGG
CTCGAGGCCC TCGATCCGGA TGGCGCCCGC GCACAGCCCG GCCTTGGCGA GGCGGCGCCG
GCCCCGCCGC TTCGCCGCGC AGGCCACTGA
 
Protein sequence
MSVQYPKFKA AACHVASVFL DSTASAEKAV ALIGEAARAG ADLVVFPEGY MPGFPLWAAL 
RAPIHNHDLF KRLAAQSVRL DGPEIGAVRA AARRHGVLVS LGFSESTEAS VGCLWNANVL
IGRDGAILNH HRKLVPTFYE KLIWANGDAR GLRVTRTEIG RVGMLICGEN TNPLARYTLM
AQGEQVHIST YPPAWPTRPP GESAAYDLKR AIEIRAGAHA FEAKVFNIVC SAVLDAAARA
TLCDGDAALA ELVERTPAGV SMVLDPTGSH VVEPHQGDET IVYADIDVEA CVEPKQFHDV
VGYYNRFDIF RLHVDRTPRE PISFDAAARP SGVAADGVDG LEALDPDGAR AQPGLGEAAP
APPLRRAGH