Gene Mext_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1117 
Symbol 
ID5833351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1218711 
End bp1220213 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content63% 
IMG OID641366912 
ProductPAS sensor protein 
Protein accessionYP_001638592 
Protein GI163850549 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATC TGGAGGAAAC AGCTCGTTCT AACGCACTGG GCCGCTACGC GATCTTGGAC 
ACGGCTCCAG AGCCGGCCTT CGACGACCTT GTGCTCCTCG CCTCGCGGAT CTGCGAGGCG
CCTGTCGCAC TCATCAGCCT TGTTGGATCA GATCGGCAGT GGTTCAAGGC ACGCATCGGC
CTTGCCCCTT CCGAAATGCC CATCGAGCAA TCGGTCTGCC GTCATGCTTT GAAGCAGGCC
GGACTCTTCG TGATCCCGGA CCTGACGCTC GATCCGCGGA CCTCCCGCAA CCCTCTCGTC
ACGGGCGAAC CTCACATCCG GTTCTACGCC GGAGCACAAC TGGTGACCTC GGACGGTGTC
GCTTTCGGCA CGATCTGTGT CATCGACACC AAGCCGCGGC CGGAAGGCCT CACGGATAAC
CAGGCCAGCA GCCTTGAGGC GCTCGCCCGG CAGGCCATGT CACAGATGGA GCTCCGTCGC
GTGATCGCCG AGCGCGCTGA GACGGCGCTG CGCAGGAGCG AGGAACGGTT CCAGGCTTTG
GCCAACCTTG TTCCGGGCTT TCTCTGGAGC AGTGATCTTG TCGGTCGAGC AACGTGGTTC
AGCGAGCGCT GGTACGAGTA CTCCGGCCAA TCTGAACCGG AGGCTCTCGG CTATGGCTGG
CAGACGGTGA TCCATCCAGA CGAGCGCGAG TACACAATCA CGGGCTTCCG GGCCGCCATG
GATCAGGAAC GTCCGTATAA CCGTGAATAC CGCATTCGCG GGAAGGATGG CATCTACCGC
TGGTTCATGG TGCGTGCTGA GCCGATACGG GATGCCGCTG GGCAGATCGA TCGCTGCTAC
GGAGCGGTCA CGGACATCCA CGATCTGCAT GAGATGCAGC AGCGTCAAGC GGTGTTGGTG
GATGAGCTGC AGCACCGTAC CCGCAATCTG CTCGCCGTGG TGCGCTCGAT CGCGCAGCAA
ACGATGACCC AGACCGGTTC GACCGAACAG TTTTGCGACC GGTTCAACGA CCGTCTCGCG
GCGCTCTCAC GGGTGCAGGG TCTGCTCTCA CGTTCCGACA AGGAGCCGAT CACCATCCAG
GCGTTGATCC AGATCGAGCT CGACGCGTTC GGGGTTGCCG CGATGCAGGC TCGAGTGGCG
CTGAAGGGCC CGCCGGTTCG CTTGCGCAAG GTCAGCGTGC AGACACTCGC TCTCGCCCTG
CACGAGTTGG CCACCAATGC GCGCAAGTAC GGCTCTCTCG CCAACGAGCA GGGGAGGCTC
TGGGTGAGCT GGGATACCTA CAGGGGAGAG GACGAAGAGC GGCGGCTATC GCTGGTTTGG
CAGGAAGAGG GTATCCGCCG GCCCCAGGAA GGCAGTCCGA TCCGGCGGGG CTATGGGCGT
GACCTGATCG AGAAGGCGCT GCCCTACGCA CTGAAGGCCC GCACCAGCTA CGAACTCAGT
GAGGCTGAGC TGCGCTGTGT CATCGACCTA CCGCTCACCG ATGGCGCGAA GAAACGGCCT
TGA
 
Protein sequence
MPDLEETARS NALGRYAILD TAPEPAFDDL VLLASRICEA PVALISLVGS DRQWFKARIG 
LAPSEMPIEQ SVCRHALKQA GLFVIPDLTL DPRTSRNPLV TGEPHIRFYA GAQLVTSDGV
AFGTICVIDT KPRPEGLTDN QASSLEALAR QAMSQMELRR VIAERAETAL RRSEERFQAL
ANLVPGFLWS SDLVGRATWF SERWYEYSGQ SEPEALGYGW QTVIHPDERE YTITGFRAAM
DQERPYNREY RIRGKDGIYR WFMVRAEPIR DAAGQIDRCY GAVTDIHDLH EMQQRQAVLV
DELQHRTRNL LAVVRSIAQQ TMTQTGSTEQ FCDRFNDRLA ALSRVQGLLS RSDKEPITIQ
ALIQIELDAF GVAAMQARVA LKGPPVRLRK VSVQTLALAL HELATNARKY GSLANEQGRL
WVSWDTYRGE DEERRLSLVW QEEGIRRPQE GSPIRRGYGR DLIEKALPYA LKARTSYELS
EAELRCVIDL PLTDGAKKRP