Gene Mext_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3897 
Symbol 
ID5834885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4331459 
End bp4334251 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content73% 
IMG OID641369688 
ProductPAS sensor protein 
Protein accessionYP_001641339 
Protein GI163853296 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.240133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC CGCCCGACGA TTGTCCAGCA TCCGGTGAGG CCGGCGCAGA GGCCGAGAGG 
CTGGCCGCGC TGGCCTGCTA CGGCATCCTC GACACGCCCG CCGAGGCCGC GTTCGACGAT
GCCGTGGCCT TGGCCGCCCA GCTCTGCGCC ACGCCGACCG CCCTCGTCAG CCTGGTGACC
GGCGACCGGC AATGGTTCAA GGCGCGGCTC GGCTTCGCCC CCGGCGAGAC CGGGCTCGAC
CGCTCGGTCT GCGTTCACGC CCTGGCCGGG CGGGGCTTGC TGGTCATCCC CGATCTCGCG
GCCGACCCGC GCACCCGCAC CAACCCGCTG GTCACGGGAG AGCCCGGCAT CCGCTTCTAT
GCCGGCGCCC CCCTGGTGAC GCCTGAGGAT CGGGCGATCG GCACCCTGTG CGTGCTCGAC
ACCAAGCCCC GGCCGGAGGG TCTGAGCGCG GCCCAAGGGG CCGGTCTCGA AGCGCTCGCC
CGCACGGTGA TGACGCAGCT CGAACTGCGC CGGGGCATCG CGGCCCGGAA GACCGAGGCC
GCGGCCCTGG CCGACAGCGA GTTCCGTCTG CGGCTCGCGA TCGAAGCGGC CGGTGCCGGC
ATCTTCGACT ACGACCTCGT GGCGGGCACC CTCGCCTGGG ACGGGCGGAC CCGCGCCCTG
TTCGGGGTCG GGCCGGACGA GGCCGTGAGC TATGCCGGCA CGTTCCTCGC CCGCCTCCAC
CCCGAGGACC GGGCGCGCAC CGATGCCGCC GTGCAGGCGG CCCTCGACCC GGCCGGGCCC
GGCCTGTTCG ACGCCACCTA CCGCACCGTG ACGGCGGACG GCACGGTGAT CGCCTGGGTC
GCGGCCCGCG GCACCCTCGT CGTCGAAACG GATGAGGGGA TCAAACGGGC GCGGCGCTTC
GTCGGCACGG TGCGCGACGT CACCGCCGAG CGCACGGCGC AGGTCGCCGT CGCCGCCACC
GAGGAGCGCT ACCGCCTCGT CACCCGCGCC ACCAACGACG CGATCTGGGA CTGGGACCTC
GCCGCCGACC ACGTGCTGTG GAACGAGGCC CTTCAGGCCG CCTATGGCTG GGCGCCGGAG
ACGGTCGAGC CGACGGGCCG GTGGTGGCTC GACCACATCC ACGCCGAGGA TCGCCCCCGC
GCCGAGGCGG GTATCCGCCG CGTCATCGGC GGCGGCGGCC ACGATTGGCA CCACGAATAC
CGCTTCTGCC GCGCCGACGG CGCCTACGCC GACGTGCTCG ATCGCGGTTC GATGGTGCGC
GGGGCCGACG GCAGGCCGCT GCGCATGATC GGCGCCATGC TGGATCTGAC CGAGCGCAAC
CGCGTCGCCG CCCAGCTCCG GGCGGTGGTC GAAGGCGCGA ATATCGGCAT CGTGCAGATC
GATCCGCGCA CCATGATCGC GCTGGAGGCC AACCCGAAGC TCTGCGCGAT CTGGGGGGCG
GAGGAATCCG ACATCGTCGG GCATTCCATT GCCAAGTGGA CGCCGGAAGC GGATGCGGCG
GAGCGCGACC AGCTCCACCG CCGGCTCGCC GCGGGCGAGA TCGTGCGCGA GACCCTGGAG
AAGCGCTACC GCCGCAAGGA CGGGCGCCTG ATCTGGGGCC GGGTCAACCT CGTCTCGCAG
GCCCGCGGCG AGGCGCTCCA GGCCACGGCG ATGATCGAGG ACATCACCGC GGAGAAGGCG
ACCGAGGCAC GCCAGACGGC GCTGATCGAA CTCGGTGACA CCTTGCGCGA CGCCGCCGGC
CCCGCCGAGA TCCGTGGGAT CGCGGCCCGG ATCCTTAGGC GCAGCCTCGA CCTCTCGGAG
GCGGGCTACG CGGCCATCGA TGCCACCGTC GGCGGCTTTG CGATCGGGCG CGCGAGACCG
GACGGGACGA TGAGCCCCGC GCCGTTTCCG GCCATGCTCG CGCGGCTGCG CCGCGGCGAG
ATCCTGGCCG TGCCCGATCT GACCGCCGAA CCGGACCTCG CGCCGGATGC AGGCGGCTAC
GCGGCGGCCG GCGCCCGTGC GCTGATCGGC GTGCCCTTGA CCCGGCGGGG CGTCCTCGTC
GGCCTCGTCT ATGCCCACGC CGCCGAACCC CGGACCTGGG ACGCGGGCGA GGTCGATTTC
GTCCGCGAGG TGGCCGGACG GATCTCCGTG GCGCTCGCCC GCATCCAGGC CGAGGAGCAG
CAGCGCTTCC TCAACCGCGA ACTGAGCCAC CGGTTGAAGA ACACCCTGAC CATGGCCCAG
GCCATCGCCT CGCAGACGCT GCGCAACGCC ACCGACATCG CCTCGGTGAA GGAGGCGCTG
GTGGCCAGGC TGGTGGCGCT CGGCAAGGCG CACGACATCC TGCTCTCGGG CGAGGGCGAG
GGGGCGGCGC TGCAGGCGGT GATCGCCGGC GCGCTCACCA TCCACGACGA CGGCGAGCCC
GGCCGCATCC ACCTGTCCGG CCCAGCCCTG GAGGTCGGGC CGAAAGCCGC GCTGTCGCTG
GCGCTGATGA TCCACGAACT CGCCACCAAT GCCGCCAAGT ACGGCGCCTT CTCGGTGCCG
GGCGGGCGCG TCGGGGTGAA CTGGCACGTC GCGCGGGCGC GTTGGCCGGA GGACGCGGAG
GATGCGGGGG AGGCCGAGCC GGTCATCACG ATGACCTGGG CCGAGACCGG CGGACCGCCG
GTCGCCGCGC CCACCCGCAA GGGCTTCGGC TCGCGGCTGA TCGAGCGCGG GTTCTCCGGG
GCGGTCGGCG GCGAGACGCA GATGATCTAC GCCCGAGAAG GGGTGACGTG CCGGATCAGG
GCGCCCCTGA AGGGTCTTCT CGAAAAAGAA TAG
 
Protein sequence
MSRPPDDCPA SGEAGAEAER LAALACYGIL DTPAEAAFDD AVALAAQLCA TPTALVSLVT 
GDRQWFKARL GFAPGETGLD RSVCVHALAG RGLLVIPDLA ADPRTRTNPL VTGEPGIRFY
AGAPLVTPED RAIGTLCVLD TKPRPEGLSA AQGAGLEALA RTVMTQLELR RGIAARKTEA
AALADSEFRL RLAIEAAGAG IFDYDLVAGT LAWDGRTRAL FGVGPDEAVS YAGTFLARLH
PEDRARTDAA VQAALDPAGP GLFDATYRTV TADGTVIAWV AARGTLVVET DEGIKRARRF
VGTVRDVTAE RTAQVAVAAT EERYRLVTRA TNDAIWDWDL AADHVLWNEA LQAAYGWAPE
TVEPTGRWWL DHIHAEDRPR AEAGIRRVIG GGGHDWHHEY RFCRADGAYA DVLDRGSMVR
GADGRPLRMI GAMLDLTERN RVAAQLRAVV EGANIGIVQI DPRTMIALEA NPKLCAIWGA
EESDIVGHSI AKWTPEADAA ERDQLHRRLA AGEIVRETLE KRYRRKDGRL IWGRVNLVSQ
ARGEALQATA MIEDITAEKA TEARQTALIE LGDTLRDAAG PAEIRGIAAR ILRRSLDLSE
AGYAAIDATV GGFAIGRARP DGTMSPAPFP AMLARLRRGE ILAVPDLTAE PDLAPDAGGY
AAAGARALIG VPLTRRGVLV GLVYAHAAEP RTWDAGEVDF VREVAGRISV ALARIQAEEQ
QRFLNRELSH RLKNTLTMAQ AIASQTLRNA TDIASVKEAL VARLVALGKA HDILLSGEGE
GAALQAVIAG ALTIHDDGEP GRIHLSGPAL EVGPKAALSL ALMIHELATN AAKYGAFSVP
GGRVGVNWHV ARARWPEDAE DAGEAEPVIT MTWAETGGPP VAAPTRKGFG SRLIERGFSG
AVGGETQMIY AREGVTCRIR APLKGLLEKE