Gene Mext_4875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4875 
Symbol 
ID5833374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5447189 
End bp5449087 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content69% 
IMG OID641370673 
ProductPAS sensor protein 
Protein accessionYP_001642314 
Protein GI163854271 
COG category[T] Signal transduction mechanisms 
COG ID[COG3300] MHYT domain (predicted integral membrane sensor domain)
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCCA CCGGCTACAA TCCGGCTCTC GTGGCGCTTT CCCTCGCCAT CGCCGTGCTC 
GCCTCCTACA CGGCGCTCGA TCTCGGCGGC CGCGTGCGGG CCGCCGCATC CGGACTGGGC
TGGGCTTGGC TTCTGGGTGC CGCCCTGGCG ATGGGGGGCG GCATCTGGTC GATGCACTTC
GTCGGGATGC TGGCCTTCGA GATGGGCCTG CCCGCCGCCT ACGACCTCGG CCACACCCTC
CTGTCCCTGC TCATCGCCAT TGGGACGACC GGCGCGGCCC TGGCCTGGGT CGGCCGTCGG
CGGGCCGGCC GCTGGGATGT CCTCGTCGCT GGCCCCGTGA TGGGGATCGG CGTGGCGGCG
ATGCACTATA CCGGGATGGC CGCGATGCGG ATGCCCGGCC ATCTCGCCTA CAGCGTGCCG
GTCGTCGCGC TCTCGGTCGG GATCGCCATC ACCGCGGCGA CCATCGCCCT GTGGCTGACC
TTTCGGCCGA CGAGCGTCTG GCAGCGCTTC GCCGCCGCCC TGATCATGGG CGTGGCCGTG
GCCGGGATGC ACTATACCGG CATGGCGGCC GCGACGATCA CCGCCGAGGA AGCCGGCGCC
CATTCCGCTC ATCTCGCCGC GACGAGCGTG GATCAGCAGA ATCTCGCCCT CTACGTCGCC
GGGGTGACCT TCGTGATCCT GTTCCTCGCG ATGCTCGCCT CCGCTTTCGA TCAGCAGAGG
ATCCAGGGCG ATTTGCGGGC GAGCGAGGAG CGTTTCCGGG CCGCCGTACA GGCCGTGCGC
GGCGTGCTGT GGACCAACGA CCCGAAGGGG CGGATGACGG GCGAACAACC CGGCTGGACC
GCCCTCACGG GGCAAACCCG CGCCGAGTAC GAGGGCTTCG GCTGGGCCGA CGCCGTGCAT
CCGGAGGACC GGCAAGCGAG CGTCGAGGCT TGGAATGCGA CTGTCGCCGC CCGCAGCACC
TTTCTGCACG AGCATCGCGT GCGCGCCCGG GACGGCCTGT GGCGACATTT CTCGATCCGC
GCAATCCCCG TGCTCGACCC GCACGGCGCC ATCCGCGAAT GGGTCGGCGT CCATACCGAC
ATCACGGAGC AGCGCGAGGC CGAGGCCGAA TTGCGGGAGT CGAACGACGA GATCCAGCGC
TACGCCTACA TCGTCAGCCA CGACCTGCGC GCGCCGCTCG TCAACGTCAT GGGTTTCACG
AGCGAGTTGG AGGCGGTGCG CCAGGAATTG CGTACGGTGC TGCGTGACCA TCCGCAGGGC
GCGCGGATCG ATGACGACAT GACCGAGGCG CTGAGCTTCA TCCAGGCGGC GATCGTCAAG
ATGGAGCGGT TGATCGCAGC CATCCTGAAG CTCTCGCGAG AGGGGCGGCG CCGGTTCAGC
CCGGAGCCGC TCGCGATGAC GCCCGTCATC CGCGGCATCG CCGACGCGCA GCGTCATCAG
GCCGGGCGCA AGGGTGTGAC GGTGACGGTG GCCGACGATC TTCCCTCGAT CGTCGCCGAC
CGGCTCGCCG TGGAGCAGGT CTTCGGCAAC CTGATCGACA ACGCTCTCAA ATATCTCGAT
CCGGCCCGTC CCGGAACGAT CGAGGTCACG GCCCGGCCTG CACCCGGCAA CCGGATCCGT
TTTGACGTCT CCGATACCGG GCGAGGTATC GCACCGCAGG ATCATGGCCG CATCTTCGAG
CTGTTCCGGC GCTCCGGCAC GCAGGATCAG CCGGGGGAGG GGATCGGGCT CGCCAGCGTG
AAAGCTTTGG TGCGCGCGCT CGGCGGCCGG ATCGAGGTTT CGTCACAACC CGGTGTCGGC
ACAACCTTTA TCGTGACGCT GCCGCGCGAG CCGGTTACAG GCCGAGGCGG GGCCGCGACA
CTCGATCCTC CCGATACGAT GACCCTGGCC GCGGAGTAG
 
Protein sequence
MVSTGYNPAL VALSLAIAVL ASYTALDLGG RVRAAASGLG WAWLLGAALA MGGGIWSMHF 
VGMLAFEMGL PAAYDLGHTL LSLLIAIGTT GAALAWVGRR RAGRWDVLVA GPVMGIGVAA
MHYTGMAAMR MPGHLAYSVP VVALSVGIAI TAATIALWLT FRPTSVWQRF AAALIMGVAV
AGMHYTGMAA ATITAEEAGA HSAHLAATSV DQQNLALYVA GVTFVILFLA MLASAFDQQR
IQGDLRASEE RFRAAVQAVR GVLWTNDPKG RMTGEQPGWT ALTGQTRAEY EGFGWADAVH
PEDRQASVEA WNATVAARST FLHEHRVRAR DGLWRHFSIR AIPVLDPHGA IREWVGVHTD
ITEQREAEAE LRESNDEIQR YAYIVSHDLR APLVNVMGFT SELEAVRQEL RTVLRDHPQG
ARIDDDMTEA LSFIQAAIVK MERLIAAILK LSREGRRRFS PEPLAMTPVI RGIADAQRHQ
AGRKGVTVTV ADDLPSIVAD RLAVEQVFGN LIDNALKYLD PARPGTIEVT ARPAPGNRIR
FDVSDTGRGI APQDHGRIFE LFRRSGTQDQ PGEGIGLASV KALVRALGGR IEVSSQPGVG
TTFIVTLPRE PVTGRGGAAT LDPPDTMTLA AE