Gene Mext_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4045 
Symbol 
ID5834515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4501422 
End bp4502546 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID641369836 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_001641486 
Protein GI163853443 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.216393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.832603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC ACCCCTACCG GTTCGGGATC GAGGAAGAGT ATTTCCTCGC CGATGCCGAG 
ACCCGCGGCA CGCCGCGCCG CTCGGTCAAG CCGTTCCATC AGGCGGCCGG CGAGCGGCTG
CCCGAGATCG GGCGCGAGCT GCTGCAATGC CAGGTCGAGG TCTGCACCCC GCCTTCCACG
GCGTTCCCGG CCGCCTACGC CGCGCTGCGG GAGCAGCGGC AGGCCTTGGC GGAGCTGGGC
CGCGAGCACG GGCTTCTGGT CTTTGCCGCC GGCACCCACC CCATCGCCGA CTGGGCGCGG
CAATTGCCGA CGAAGGGCGA CCGCTACCGC GGCATCCTGC GCGATGTCGG CCTCGCCGGG
CGGCGCAGCC TGATCTGCGG CATGCATGTC CATGTCGAGG TCGCCGACCC CGACCGGCGC
GTCGCGCTGA TGGACCGGCT GCTGCCGTTC CAGCCGCTGC TATTCGCCCT CTCCGTCTCC
TCGCCGTTCT GGCAGGGCAA GCCGACGGGG CTCGCCGCCT ACCGCCTCAG CGCCTTCGGC
GAGCTGCCGC GCACCGGCCT GCCCGACCTG ATGGGCAACG CGGCGGCCTA CGAGCGCTAC
GTGCGCATCA TGACCAATGC CGGCTCGATC CAGGATGCGA GCTTCCTGTG GTGGTCCCTG
CGGCCCTCGA TCAAGTTCCC GACGCTCGAA CTGCGCATCG CCGATTCCTG CACCCGGCTC
GCCGACGCCA TCGTGGTGGC GGCCCTGTTC CGCTGCCTCG TGCGGCTGAT CGAGCGCCGG
CCCGACATCA ATGCGGGGCT CACCGGCGTC TCGCGGGCGA TCGCGGCGGA GAACCTGTGG
CGGGCCCAGC GCAGCGGGAT CGAGGCCGAG CTGATCGACG AGGCGTCCGA GCAGGCCTTC
CCCTTCGACG CCGCCCTCGA CACGCTGCTC TCGCTGATCG CGGAGGATGC CGAGGCGCTG
GGCTGTACCG AAGAGGTGGC CGACGCCCGC CGCATCCTGC GCGAGGGCAC CAGCGCCGAC
CGGCAGATCG CCGCGTTCGA GGGCACCCGC GGCGGCGACA CCAGCAACCG CCAGGCCCTC
GACGCGGTGG TGGACTGGCT CGCCGAGACC ACGCGGGACG GGTGA
 
Protein sequence
MSAHPYRFGI EEEYFLADAE TRGTPRRSVK PFHQAAGERL PEIGRELLQC QVEVCTPPST 
AFPAAYAALR EQRQALAELG REHGLLVFAA GTHPIADWAR QLPTKGDRYR GILRDVGLAG
RRSLICGMHV HVEVADPDRR VALMDRLLPF QPLLFALSVS SPFWQGKPTG LAAYRLSAFG
ELPRTGLPDL MGNAAAYERY VRIMTNAGSI QDASFLWWSL RPSIKFPTLE LRIADSCTRL
ADAIVVAALF RCLVRLIERR PDINAGLTGV SRAIAAENLW RAQRSGIEAE LIDEASEQAF
PFDAALDTLL SLIAEDAEAL GCTEEVADAR RILREGTSAD RQIAAFEGTR GGDTSNRQAL
DAVVDWLAET TRDG