Gene Mext_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3097 
Symbol 
ID5831118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3447999 
End bp3449885 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content71% 
IMG OID641368897 
Producthypothetical protein 
Protein accessionYP_001640556 
Protein GI163852513 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA TGCCCGACGC TCCGACCCTC GATGCGGTGA TCGCGCAGCG CTTCTCTCGG 
CGCGACCTGA TGCGCGGCTC GCTCGCCACC GCACTCGCCG CCGGACTCGC GCCGCAGACC
GTCGCGGCCC CCGCCGATTC CTCCGCCTTC GACTTCCCCG AACTCGCCGC CGGAATCGAT
GAGCACCTGC ACGTGGCCGA GGGCTACGAT GCGAAGCCGC TGCTGCGCTG GGGCGACCCG
CTGTTTCTCG ACGCGCCGGC CTTCGATCCG CAAATGCAGA GCGCGCGGGC TCAGGAGCGC
CAGTTCGGCT ACAACAACGA CTTCGTCGGC TTCATCCCGC TCGACGCGGC GAGCCAACGT
GGTCTCCTCG TGGTCAACCA CGAATACACC AACGCCGAGC TGATGTTTCC CGGGCTCGCG
GCCGCCGACC GCAAGGCCGT GATCGCCGCC CTGAGCCCGG AACAGGTTGC GATCGAGATG
GCCGCCCATG GCGGCTCGGT GGTCGAGATC GTCCGCGAGG CGGAAGGCTG GCGCCCGGTG
ATCGGCTCGC CCTACACGCG GCGCATCACG GCGGAGACGC CGATCGCCCT CACCGGCCCC
GCCGCCGGCC ATCCCCGCCT CATGACCGAG GCCGACCCGA CCGGCCGGCG CGTGCTCGGG
ATGATCAACA ACTGCTCGGG CGGTGTCACG CCCTGGGGCA CGTGGCTCTC AGGCGAGGAG
AACATCAACT ACTACTTCTC GGGAGCTCTC CCTGCCGGAC ATGCGGAGGC CGGCAACGCG
AAGGCGATCG GCCTGAGCAG CCCGCAATAT GCCTGGAGCC GCTTCCATCC GCGTTTCGAT
CTCGCACAGT CGCCGAACGA GCCGAACCGG TTCGGCTGGG TGGTCGAGAT CGATCCGTTC
GATCCGGCCT CGACGCCGAA GAAGCGCACG GCGCTCGGCC GGTTCAAGCA TGAGGGTGCG
GCGGGCGCGC GTTCGCTCGA CGGACGCTAC GTCGTCTATC TCGGCGACGA CGAGCGCTTC
CAGCACGTCT ACCGCTTCGT CAGCGAGGGC CGGGTGCAGG CCGAGCGCGC GGCCAACGCC
GACCTTCTCG ATTCTGGCAC GCTCAGCGTC GCCCGGTTCG AGCCCGACGG CACCGGGCGC
TGGCTGCCGC TCGTGCACGG CGCGAACGGG CTCGATGCGG GCAACGGTTT CGCCAGTCAG
GCTGACGTGC TCATCGAAGC CCGCCGCGCC GCCAAGAGCC TCGGCGCCAC GCCGATGGAT
CGGCCCGAGG ACATCGAGGC GAACCCGCGC ACCGGCCGCG TCTACGTGAT GCTGACCAAC
AACGGGAAGC GCACCGCCGA TCAAGAAGAG CCCGCCAACC CGCGCGGCCC CAACGCCTTC
GGCCACGTCA TCGAGATCAC CCCCGACGGC ACCGACCACG CCGCCGAGAC CTTCCGCTGG
GAGGTGCTGG TGCGCTGCGG TGACCCGGCC AAGCCCGAGG TGAAGGCGAG CTTTTCCGCG
CTCACCACGG AGAACGGGTG GTTCGGCATG CCCGACAACT GCACCGTCGA CGGGCGCGGC
CGCCTCTGGA TCGCCACCGA CGGCAACAAC CGCCGCGCCA CCGGCCGGGC CGACGGCATC
TGGGCGGTGG AAACGGAAGG CCCGCGCCGG GGCACCGCGC GCCACTTCCT GCGGGTGCCG
GTGGGCGCCG AGATGTGCGG CCCCTGCTTC ACCCCCGACG ACGAGACCTT CTTCGTCGCC
GTCCAGCATC CCGGCGAGCC CGACGAGGAG GGCGCCCTCG GCTCCTACGA GAAGCCCTCG
ACCCGCTGGC CGGATTTTTC GCCCGATCTG CCGCCGCGAC CGTCCGTCGT GGTGGTGCGG
CGGACGGGGG GCGGACGGAT CGGCTGA
 
Protein sequence
MSDMPDAPTL DAVIAQRFSR RDLMRGSLAT ALAAGLAPQT VAAPADSSAF DFPELAAGID 
EHLHVAEGYD AKPLLRWGDP LFLDAPAFDP QMQSARAQER QFGYNNDFVG FIPLDAASQR
GLLVVNHEYT NAELMFPGLA AADRKAVIAA LSPEQVAIEM AAHGGSVVEI VREAEGWRPV
IGSPYTRRIT AETPIALTGP AAGHPRLMTE ADPTGRRVLG MINNCSGGVT PWGTWLSGEE
NINYYFSGAL PAGHAEAGNA KAIGLSSPQY AWSRFHPRFD LAQSPNEPNR FGWVVEIDPF
DPASTPKKRT ALGRFKHEGA AGARSLDGRY VVYLGDDERF QHVYRFVSEG RVQAERAANA
DLLDSGTLSV ARFEPDGTGR WLPLVHGANG LDAGNGFASQ ADVLIEARRA AKSLGATPMD
RPEDIEANPR TGRVYVMLTN NGKRTADQEE PANPRGPNAF GHVIEITPDG TDHAAETFRW
EVLVRCGDPA KPEVKASFSA LTTENGWFGM PDNCTVDGRG RLWIATDGNN RRATGRADGI
WAVETEGPRR GTARHFLRVP VGAEMCGPCF TPDDETFFVA VQHPGEPDEE GALGSYEKPS
TRWPDFSPDL PPRPSVVVVR RTGGGRIG