Gene Mext_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0020 
Symbol 
ID5831663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp22475 
End bp23674 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID641365805 
Producthypothetical protein 
Protein accessionYP_001637520 
Protein GI163849477 
COG category[S] Function unknown 
COG ID[COG4320] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.117767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGAG CCTATGGTCC TGAGGAACGT GCTTTGGTGC TGGAGCGGCA ACGCACCCTT 
AAAATGGCCC AATCGGCCCA CGCCTATGTC CGGGGCAATA CCCTGAAGTT CTACGAGTGG
CTCGAGGGTC TCACCCGCGG CACCCTGCCG GAGGGACCGC CGGTCTGGAT CTGCGGGGAT
TGCCATCTCG GAAATCTCGG GCCCCTGGCC GATGCCGACG GCCGCGTCGA CATCCAGATT
CGCGATCTCG ACCAGACCGT CATCGGCAAC CCAACGCATG ACCTCGTGCG CCTCGGGCTG
TCGCTCGCCA GCGCCGCGCG CGGCTCCGAC CTCCCCGGTG TGGTGACGGC GCGAATGCTG
GAACAGATGC TTCTGGGCTA CGCCGCGGGG CTGGGACAGA ACGAGACGAA CCGGGAGCCC
TCCGAACCGG ACGCGGTGCG CTCGGTCCGC CGCCGGGCCC TCGGGCGTCA CTGGAAGCAC
CTCGCGCGGG AGCGCCTGAA GGGCGTGGAG CCGGCGATTC CGCTCGGACG CAAGTTCTGG
AAGCTCGACA CGGAGGAGCG CGAGGCCCTC GATGGGGTCT TTCAGGAAGA CGCGGTGCGC
CACCTCGTGC TGGCCCTGAA CGGGCGCAGT GACGAGGCCG AGATCCGCCT GATCGACGCA
GCCTACTGGA TGAAGGGATG CAGCTCGCTC GGCTTCCTGC GCTACGCCGC GCTCGTCGGC
ATCACCGAGC CCGGAAACAA GCGCCGGCTC GCGCTGGTGG ACTTGAAGGA GGCGGTGGCG
CCGGCCGCGC CGACCGCTCC CGGTGTGGCG ATGCCCTCCG AGCCGGCTGA ACGGGTGGTG
GCCGGCGCGC GGGCCCTGTC GCCGAATCTG GGCGAACGCA TGCTGCCGGT TCGGTTGCTG
GGCAAGTCGG CCGTGATGCG CGAACTCGCG CCACAGGACC TGAAGCTCGA CGTCGATCAA
TTCGGCCGCG AGGAGGCAGT CCGCGCCGCG CATTACCTCG CCCATGTCGT CGGAAAGGCG
CATGGCCGGC AGATGGACGC CGAGACCCGC GCGGCATGGC GGACCGAGAT CACGCGTGGC
AACGACGTGG ACGAGGGGGC GCCCTCCTGG CTGTGGTCCA GCGTGGTCGA ACTCGCCGGG
CGGCACGAGG TCGGATACCT CCAGCATTGC CGCCGCTACG TCGGCCAGGA GGCGGCCTGA
 
Protein sequence
MNGAYGPEER ALVLERQRTL KMAQSAHAYV RGNTLKFYEW LEGLTRGTLP EGPPVWICGD 
CHLGNLGPLA DADGRVDIQI RDLDQTVIGN PTHDLVRLGL SLASAARGSD LPGVVTARML
EQMLLGYAAG LGQNETNREP SEPDAVRSVR RRALGRHWKH LARERLKGVE PAIPLGRKFW
KLDTEEREAL DGVFQEDAVR HLVLALNGRS DEAEIRLIDA AYWMKGCSSL GFLRYAALVG
ITEPGNKRRL ALVDLKEAVA PAAPTAPGVA MPSEPAERVV AGARALSPNL GERMLPVRLL
GKSAVMRELA PQDLKLDVDQ FGREEAVRAA HYLAHVVGKA HGRQMDAETR AAWRTEITRG
NDVDEGAPSW LWSSVVELAG RHEVGYLQHC RRYVGQEAA