Gene Mext_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2233 
Symbol 
ID5831927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2480695 
End bp2481624 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content67% 
IMG OID641368032 
Productsulfate adenylyltransferase subunit 2 
Protein accessionYP_001639699 
Protein GI163851656 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID[TIGR02039] sulfate adenylyltransferase, small subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.230531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTG CCGTCGCCGC GCCCGCGCGC ACCCGCCTGA CGCATCTCCA GCGTCTCGAG 
GCCGAGAGCA TCCACATCTT CCGGGAGGCC GTCGCCGAGG CCGAGAACCC GGTGATGCTC
TACTCGATCG GCAAGGATTC GTCGGTGCTG CTGCACCTGG CGCTGAAGGC CTTCGCGCCG
GGGCGCCTCC CGTTCCCCCT GATGCACATC GACACGACCT GGAAGTTCCG CGAGATGATC
GCCTTCCGCG ATCGGCGAGC CAAGGAGCTC GGGCTCGAAC TCATCGTGCA CACGAATCAG
GACGGGCTTG CCAAGGGCGT CGGCCCGGTC AGCCACGGCT CGGAAGTGCA TACCGACGTG
ATGAAGACGC AGGCCCTGCG GCAGGCGCTC GACAAGTACA AGTATGACGT GGCCTTCGGC
GGCGCCCGCC GGGACGAGGA GGCCAGCCGC GCCAAGGAGC GCATCGTGAG CCTGCGCAAC
GGCCAGCACC GCTGGGACCC GAAGCGCCAG CGCGCCGAGC CGTGGCACCT CTACAATTTC
AAGAAGCGGC GCGGCGAGAG TTTTCGCGTG TTCCCGCTAT CCAACTGGAC CGAATTGGAT
ATCTGGCTCT ACATCGAGCA GGAAAATATT CCGATCGTCC CGCTCTACTT CGCCGCCGAG
CGCCCGGTGG TGGAGCGCGA CGGCCAGCTC ATCATGGTCG ATGACGAGCG CTTTCCGCTG
GAGCCGGGCG AGACCCCACA ACAGCGGCAG GTCCGGTTCC GCACGCTCGG CTGCTACCCG
CTGACCGGCG CGGTCGAGAG CCCGGCCGCG ACCCTGCCGG AGATCATCGG CGAGACGCTG
GCCGCCCGAA CCTCGGAGCG CCAGGGCCGG GTCATCGACA AGGACGGCGC CGGCGCCATG
GAGCGCAAGA AGCAGGAGGG CTATTTCTGA
 
Protein sequence
MSAAVAAPAR TRLTHLQRLE AESIHIFREA VAEAENPVML YSIGKDSSVL LHLALKAFAP 
GRLPFPLMHI DTTWKFREMI AFRDRRAKEL GLELIVHTNQ DGLAKGVGPV SHGSEVHTDV
MKTQALRQAL DKYKYDVAFG GARRDEEASR AKERIVSLRN GQHRWDPKRQ RAEPWHLYNF
KKRRGESFRV FPLSNWTELD IWLYIEQENI PIVPLYFAAE RPVVERDGQL IMVDDERFPL
EPGETPQQRQ VRFRTLGCYP LTGAVESPAA TLPEIIGETL AARTSERQGR VIDKDGAGAM
ERKKQEGYF