Gene Mext_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0294 
Symbol 
ID5832454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp331146 
End bp332207 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID641366079 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001637789 
Protein GI163849746 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.496301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGT TCACCGAACT CGTGTTCTCC GGCGTCCAGC CGACCGGGAA CCTGCACCTC 
GGCAATTATC TCGGCGCCAT CAAGCGCTTC GTCGAGATGC AGGCGCGCGA CGCGCAGTGC
CTCTATTGCG TGGTCGATCT CCACGCCATC ACGATGTGGC AGGACCCGGA GGCGCTCAAG
GGCCAGATCC GCGAAGTGAC GGCGGCCTTC CTCGCCGCCG GCATCGATCC GAAGCGCTCC
ATCGTCTTCA ACCAGTCCCA GGTGCCGCAG CACGCGGAAC TCGCCTGGAT CTTCAACTGC
GTCGCCCGCC TCGGCTGGCT CAACCGCATG ACGCAGTTCA AGGACAAGGC CGGCAAGGAC
CGGGAGAACG CCTCCATCGG TCTCTACGAT TACCCCGTGC TGATGGCCGC CGACATCCTC
GCCTATCGTG CCACGCATGT GCCCGTGGGC GAGGATCAAA AGCAGCACCT CGAACTGACC
CGCGACATCG CGCAGAAGTT CAACAACGAC TTCGCGGGGT CGATCCTGGC GCATGGCCAC
GGCGAACAGT TCTTCCCGAT CACCGAGCCG CTGATCGGTG GGCCGGCGGC GCGCGTGATG
TCCCTACGCG ACGGCACCAA GAAGATGTCG AAGTCGGACC CGTCCGAGTA TTCGCGCATC
GCGCTCACCG ACGACGCCGA CGCCATCGCC CAGAAGGTGC GCAAGGCCAA GACCGATCCG
GAGCCCCTCC CCTCGGAGGT TGCGGGCCTG GCCGGTCGGC CGGAGGCCGA CAACCTCGTC
GGCATCTTCG CGGCGCTGCG CGGCATCACC CGCGACGAAG TGCTGGCGGA TTTCGGCGGA
GCGCAGTTCT CCAGCTTCAA GCCGGCTCTG GTCGATCTCG CCGTCGAAAC GCTGGCGCCG
ATCGGTGCCG AGATGAAGCG GCTCGTCGCC GATCCGGCCT ATATCGATTC CGTTCTCGGA
GACGGCGCGA GCCGGGCCGA GGCGATCGCG GCGCCGACGC TGGATGCGGT CAAGGACATC
GTCGGCTTCG TCCGGCGCGG GCCGGCGCTC AGGGCGGTTT AG
 
Protein sequence
MAAFTELVFS GVQPTGNLHL GNYLGAIKRF VEMQARDAQC LYCVVDLHAI TMWQDPEALK 
GQIREVTAAF LAAGIDPKRS IVFNQSQVPQ HAELAWIFNC VARLGWLNRM TQFKDKAGKD
RENASIGLYD YPVLMAADIL AYRATHVPVG EDQKQHLELT RDIAQKFNND FAGSILAHGH
GEQFFPITEP LIGGPAARVM SLRDGTKKMS KSDPSEYSRI ALTDDADAIA QKVRKAKTDP
EPLPSEVAGL AGRPEADNLV GIFAALRGIT RDEVLADFGG AQFSSFKPAL VDLAVETLAP
IGAEMKRLVA DPAYIDSVLG DGASRAEAIA APTLDAVKDI VGFVRRGPAL RAV