Gene Mext_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4290 
Symbol 
ID5834926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4775937 
End bp4776983 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID641370081 
Producthypothetical protein 
Protein accessionYP_001641730 
Protein GI163853687 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCT CACAATCGGC TCGTGAGCAG GTAATCTTCG GCATGGCGCG CAAGAACGGC 
AAGGACGGCA AGAGCGCCGG GAGTGAAAAA AGCCTCGAGA GCGACAAAAC GACGGAAGCA
CAACCGCAGG CCGCGTGGCC CGACCATCCG CCTTCCTTCG CCGGCTGGGC CCGCGCGGCG
ATCGCGGGCA CGGGCACCGC ACCGAGCCTG TCCCCGCATC TCCACCCGGT CCTGCCGCCG
GCCGCGCCCG GCATCGTCAC GGTCGAACCC GGCCAGAGCG TCAACCTCGC CGCGATCGAT
CCCGACGCCA GCGGCGGTCT CGAGAAGGCG GCGGCCAAGA CCGAACTCGA CGCGCAGCGC
GTGCGCATCC GGGCGCTGCA GGAGAAGCTC TACGCCGAGC ATCGCCGCTC CCTGCTCGTG
GTGTTCCAGG CGATCGATAC CGGCGGCAAG GACGGCACCA TCCGCAACGT GCTGGAGGGG
GTGAACCCGC AGGGCTGCCG GGTCTGGTCG TTCAAGGTGC CGAGCACGGA GGAACTCGAT
CAGGATTTCC TCTGGCGCTA CCACCTGCGC ACGCCCGGCC GCGGCCTGAT CGGCGTGTTC
AACCGCAGCC ATTACGAGGA CGTGCTCGTG GTGCGGGTGA AGGGCCTCGT GCCGGAGGAG
ACGTGGCGCG AGCGCTACGG GATCATCAAC GATTTCGAGC GGCTGCTGAC GCTCTCGGGC
ACGGTGATCC TCAAGTTCTT CCTGCACATC TCCAAGGACG AGCAGAAGGA GCGCTTGGAG
GCCCGCCTCG CCGATCCGGA GAAGCACTGG AAGTTCGACC CGGCCGACCT CGTGGAGCGC
AAGAGCTGGG ACGCCTACCA GACCGCCTTC AACGACGCGC TCGCCCGCTG CTCGACGCCC
TACGCCCCCT GGCACGTGGT GCCGGCCAAC CGCAAATGGG CCCGTAACGT CATGGTCGCC
CGCACCATCG CCGACACGCT GGAAGCGATG GACCCGCGCT TCCCCGAGCC GCGCAAGGGG
CTGGACGGTA TCAAGGTGCC GGATTGA
 
Protein sequence
MATSQSAREQ VIFGMARKNG KDGKSAGSEK SLESDKTTEA QPQAAWPDHP PSFAGWARAA 
IAGTGTAPSL SPHLHPVLPP AAPGIVTVEP GQSVNLAAID PDASGGLEKA AAKTELDAQR
VRIRALQEKL YAEHRRSLLV VFQAIDTGGK DGTIRNVLEG VNPQGCRVWS FKVPSTEELD
QDFLWRYHLR TPGRGLIGVF NRSHYEDVLV VRVKGLVPEE TWRERYGIIN DFERLLTLSG
TVILKFFLHI SKDEQKERLE ARLADPEKHW KFDPADLVER KSWDAYQTAF NDALARCSTP
YAPWHVVPAN RKWARNVMVA RTIADTLEAM DPRFPEPRKG LDGIKVPD