Gene Mext_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3663 
Symbol 
ID5832093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4050971 
End bp4052083 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content68% 
IMG OID641369456 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001641112 
Protein GI163853069 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTC CGAACTCGTC CTCCGCGTCC GAGCCGATCC GCTTCGCCTA CTGGGTGCCC 
AATGTCTCGG GCGGCCTCGT CATCAGCAAG ATCGCGCAGC GCACGAGCTG GGACGCGGAC
TATAACCGCA AGCTCGCGCA GATCGCGGAG GCGGCGGGCT TCGACTACGC CCTGACCCAG
ATCCGCTTCA CCGCCGGCTA CGGCGCCGAG TACCAGCACG AATCGGTCGC CTTCAGCCAC
GCGCTCGCCG CCGCCACGAC CCGGCTGACG GTGATCGCCG CGATCCTGCC CGGCCCCTGG
AACCCGACGC TCGCGGCCAA GCAGATCGCC ACGATCTCCC AGCTCACGGA AGGACGGATC
GCGATCAACA TCGTCTCGGG CTGGTTCTCC GGCGAGTTCC GGGCGATCGG CGAGCCCTGG
CTCGACCACG ACGAGCGCTA CCGCCGCTCG GAGGAGTTCA TCCGGTCCTT GCGCGGGATC
TGGACGCAGG ACGCCTTCAG CTTCCGCGGC GATTTCTATC GCTACACGAA CTACACCCTG
AAGCCGAAGC CGGGGCCGAA CCTGCCGGAG ATCTTCCAGG GCGGCTCCTC GCGCGCCGCC
CGCGACATGG CCGCCCGCGT CTCCGATTGG TACTTCACCA ACGGCAACAC GCCCGACGGC
GTGCGGGCGC AGGTCGAGGA TCTGCGCGCC AAGGCGCAGG CGAACGGCCA TTCGGTGAAG
GTCGGCGTCA ACGCCTTCGT CATCGCCCGC GAGACGGAGG AGGAGGCCCG CGCCGTCCTT
CAGGAGATCA TCGAGAACGC CGATCCGGAC GCGGTGAAGG CCTTCGGCCA CGAGGTGAAG
AACGCCGGCG CGGCCTCGCC CGAAGGCGAG GGCAACTGGG CGAAATCGAC CTTCGAGGAT
CTCGTCCAGT ACAACGACGG CTTCAAGACC AACCTGATCG GCACGCCCGA CCAGATCGCC
GAGCGCATCC TCGCCCTCAA GGATGCTGGC GTCGATCTCG CCCTGCTCGC CTTCCTGCAC
TTCCAGGAAG AGGTGCAGTA TTTCGGCGAG CACGTGATCC CGCGGGTCCG CGCGCTGGAA
GCCGCCCGCG AGCGCCGGGC CGAGGCGGCC TGA
 
Protein sequence
MSVPNSSSAS EPIRFAYWVP NVSGGLVISK IAQRTSWDAD YNRKLAQIAE AAGFDYALTQ 
IRFTAGYGAE YQHESVAFSH ALAAATTRLT VIAAILPGPW NPTLAAKQIA TISQLTEGRI
AINIVSGWFS GEFRAIGEPW LDHDERYRRS EEFIRSLRGI WTQDAFSFRG DFYRYTNYTL
KPKPGPNLPE IFQGGSSRAA RDMAARVSDW YFTNGNTPDG VRAQVEDLRA KAQANGHSVK
VGVNAFVIAR ETEEEARAVL QEIIENADPD AVKAFGHEVK NAGAASPEGE GNWAKSTFED
LVQYNDGFKT NLIGTPDQIA ERILALKDAG VDLALLAFLH FQEEVQYFGE HVIPRVRALE
AARERRAEAA