Gene Mext_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3362 
Symbol 
ID5835291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3725376 
End bp3727412 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content63% 
IMG OID641369161 
ProductPAS sensor protein 
Protein accessionYP_001640819 
Protein GI163852776 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGTG GTAGGTTCAG CCGTATGTCA GATGAACAAA TGCGCCTCGG CGCCCTGCAC 
CACCAAGACG TAAGGGCAAC CAAGGCCATC GAAGCGGCCG CAGATCGAGA TGACCGCTAC
CGTCTGCTCG TTGAGGCGAT CGTCGATTAC GCAATCTATA TGCTCGACCC ACGAGGGTAT
ATTGCGAGCT GGAACCCGGG CGCCCAACGC TTCAAAGGTT ATACAACCGA CGAGATCATC
GGAGAGCATT TCTCCCGGTT CTACACCGCT GAGGATCGCG CGACGGATCT GCCGACCCGT
GCTCTGCATA TTGCCGCCAC CGAGGGCCGC TTCGAGCAGG AGGGGTGGCG CGTTCGCAAG
GACGGTTCCC TGATGTGGGC GCACGTGCTG ATTGATCCGA TCCGAAGCGA AGACGGCCAG
CTTGTCGGCT ACGCCAAAAT CACGCGTGAC CTGAGCGAGC GGAAGGCAAC GCAGGAGGCT
TTGCGGCAGA GCGAACAGCG CTTCCGCCTG CTCGTACAGG GCGTTCAGGA CTACGCGATT
TACATGCTCG ACCCGCAGGG CAAGGTTTCG AGCTGGAACC GCGGCGCGCA ACGCTTCAAG
GGCTACACCG ACGCAGAGAT CATCGGGGAG CACTTCTCTC GGTTCTACAC CGATGAGGAC
CGGGCGACAG ATCTGCCGAC CCGTGCCCTG CGGACGGCTG CGGACGAGGG GCGCTTCGAG
GCGGAGGGAT GGCGCGTGCG CAAGGACGGC ACACGGTTCT GGGCGCACGT CGTCATCGAC
GCAATCCGCG GCGATCACGG TGAACTGGTC GGGTTCGCCA AGATCACCCG GGACATCACC
GAGCGGCGCA ACGTTCAGCA GGAGCTGGAA GCGACCCGTG CACGCTTCAT CCAGTCCCAG
AAGATGGAGG CAATCGGCCA GCTCACCGGC GGCGTTGCGC ACGACTTCAA CAATCTACTG
GCGGTCGTGC TCGGCAACCT GAGCCTCGCT CGCAAGCGGC TGCCGAACGA CCCGAAGCTG
CGGCAATTGA TCGAGAACTC GATCCAGGCT GCCGAACGGG GAGCGGCACT GACCAAGCGC
ATGCTTGCCT TCGCGCGCCG GCAGGAGCTG GAGACCGGCC CCATCAACGT GCCGGACCTC
GTGCGCGGCA TGGCCGAACT GCTGCAACGC TCGATCAGCT CCACCATCCT GGTCAACACG
CAGTTCCCGC TGCAACTGCC GCTGGCCTTT GCAGACGCAA GTCAGCTTGA GCTAGCGTTG
CTCAATCTGA CTGTGAACGC GCGCGACGCG ATGCCGGAAA GTGGAACGAT CACGATTGCC
GCCCGGGAGG AGCGGGTCGG CACGGACGAG ATCGCGGGCT TGGCGCCAGG ACACTACGTT
TGCCTGTCTG TGACGGACAC GGGCACAGGG ATGGACGCGG AGACACTGGC GAAGGCGACA
GAACCGTTCT TCACCACCAA GGGCGTTGGT AAGGGCACGG GGCTGGGTCT CTCCATGATC
CACGGCTTCG CCGAACAGTC CGGCGGCCGT CTGGTGCTCA AGAGCAGCCT GGGCAGCGGT
ACCACCGCAG AACTCTGGCT GCCTGCCGCC GAGGGCGATC ACCTTCGCAA AGACGACCAC
GCGCCGGCGC CGGAGTTACC GTCCCTGCAT GCTCTCACGG TGCTGGTGGT TGATGACGAC
CCGCTGGTGC TGATGAACAC CGGAGCCATG TTGGAAGACC TTGGCCATGA GGTGCTTGAG
GCAACCTCGG GTGAGCAAGC GCTCCGCGTT CTACGACGCG CCGAGAGCAT TGATCTGGTG
ATCACGGATC AGATGATGCC GGGCATGACC GGCGTGCAGC TGATTGACGC TCTCAAGGCC
GAACGCGCGG ATCTGCCGGT GATCCTGGCA AGTGGCTATG CTGAGTTGCC GGAGGATCGG
CTCACGGGCA TCGTGCGCCT CGGCAAGCCG TTCGAACAGG TCGATCTCGC ACGCGCACTT
GTGACGAGCC TCCGGCCAGA GGCGGAGATT GTGCCGTTCC GGCCCAAGCG AAGCTGA
 
Protein sequence
MMRGRFSRMS DEQMRLGALH HQDVRATKAI EAAADRDDRY RLLVEAIVDY AIYMLDPRGY 
IASWNPGAQR FKGYTTDEII GEHFSRFYTA EDRATDLPTR ALHIAATEGR FEQEGWRVRK
DGSLMWAHVL IDPIRSEDGQ LVGYAKITRD LSERKATQEA LRQSEQRFRL LVQGVQDYAI
YMLDPQGKVS SWNRGAQRFK GYTDAEIIGE HFSRFYTDED RATDLPTRAL RTAADEGRFE
AEGWRVRKDG TRFWAHVVID AIRGDHGELV GFAKITRDIT ERRNVQQELE ATRARFIQSQ
KMEAIGQLTG GVAHDFNNLL AVVLGNLSLA RKRLPNDPKL RQLIENSIQA AERGAALTKR
MLAFARRQEL ETGPINVPDL VRGMAELLQR SISSTILVNT QFPLQLPLAF ADASQLELAL
LNLTVNARDA MPESGTITIA AREERVGTDE IAGLAPGHYV CLSVTDTGTG MDAETLAKAT
EPFFTTKGVG KGTGLGLSMI HGFAEQSGGR LVLKSSLGSG TTAELWLPAA EGDHLRKDDH
APAPELPSLH ALTVLVVDDD PLVLMNTGAM LEDLGHEVLE ATSGEQALRV LRRAESIDLV
ITDQMMPGMT GVQLIDALKA ERADLPVILA SGYAELPEDR LTGIVRLGKP FEQVDLARAL
VTSLRPEAEI VPFRPKRS