Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2103 |
Symbol | |
ID | 5833210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2359837 |
End bp | 2361804 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641367900 |
Product | PAS sensor protein |
Protein accession | YP_001639569 |
Protein GI | 163851526 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.00365518 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGGGA GCGACAGTCC ACCTGCGCAG TCGCGCCGCG GCGAGATGGC CGAGCGGATC CGGGCGCATG ACTGGACCGC GACGGCGCTC GGTGCGGCGG ACGCTTGGCC GCCGAGCCTC AGGGCCACGA TCAGCCTGAT CCTCGGCTGC GGTTTCCCCA TGATCGCGCT GTGGGGCCGG GATCTGATCC AGGTCTACAA CGACGACTTC CGGGACCTGA TGGGATCGAA GCACCCGGCC GGGCTGGGTC AGCCGGCACG CGCGTGCTGG CCCGAGATCT GGCACATCAC CGCGCCGATC TACGAGCGGG TCTGGAACGG CGAGACGTTC ACCTTCGAAG ACGCGCTCTA CCCGCTGTTC CGGTCGGGCC GGCTCGAAGA CGCGTGGTTC ACGCTGACCT ACAGCCCGCT GCGGGACGAG GCGGAGCGGA TTGCGGGCAT TCTGGTCACC CTGGTCGAGA GTACGGCCCG CGTGCTGTCC GACCGGGCGT TGCGCGAGAG CGAGGCGCGC TTCCGGGCCG AGCTGGAACG TCAGGTCCAG GAGCGGACGG CGGAGCTTCA GGCGAGCCGA GACCTGCTCA AGGCGACCAT GGACAGTTCC ATGGACATGA TTCAGGTCTT GAAGGCCGTG CGCGATCCGG CGGGCGAGAT CATCGATTTC CGCTGGCTCC TGAACAACTC CACCTCGGCG AGCCGCTACG GCGATGTGGG CGGTCAGAGT TTGCTCGAAC GCAATCCGGG CGTGATTCAG GAGGGCATCT TCGACACCTT CAAGCGTGTC ACGGAAACAG GCCAGCCCGC GACCGCGGAG CGCCGCTACG CCCACGAGCA GTTCGACGGC TGGTTCTTCC AGTGCGCGGT GAAGCTTGGT GACGGAGTCG CTATCACCAC CAAGGAAATT TCGGCGTGGA AGGCGGCGCA GAACGAGATG CTGCGGCTTC GCGACGAGAG CGCAAACGCG GCCCTGCGCG AGAGCGAGGA ACGCTTTCGC ACCCTGGCGA GCCTCATCCC CGTCCTGCTG TGGCGCTCGG ACGAGAGCGG GCAGCACAAC TCCCTCAACG AGGCCTGGCT CACCTATACC GGCCAGACCC TGCAGCAATC CCAGGCCGGC GGCTGGCTCG AAGCGATCCA TCCCGCCGAC CGCGACGCGG TGAGCGAGGC CTTCCGCTCC GGACGCGAGC AGCAGCGGTT GATCGAGGTG CAGCAGCGCA TCCGCCGGTA CGACGGGCAG TATCGCTGGT TCCTCGTACG GCAGGCGCCC CTCCTCGACA CCGAGGGGCA GGTCACGCAG TGGATCGGTG CCGCCATGGA CATCCACGAT CTGCACGATC TGCAGGAGCG CCAGACCATC CTCGTCGCCG AGTTGCAGCA CCGCACCCGC AACCTGCTCG GCGTCGTGCG CTCCATCGCC CACCAGACCA TGGCGCAGAC CGGTCCGACG GAGCGCTTCC GCGAGCAGTT CAACGACCGG CTCGCCGCCT TGTCGCGGGT TCAGGGGCTG CTGTCGCGCT CGGAGCAGGA GCCGATCACC CTGCGCACCC TGATTCGAAC GGAGCTGGAC GCCCTCGGGG GCGGCGACTT CGCCGATCGA ATCCATATCG CCGGGCCGCC GGTGCGCCTG CGCAAGGCGT CGGTGCAGAC CCTGGCGCTC GCCGTGCACG AACTGGCCAC CAATGCCCGC AAGTACGGTG CCCTGACGAC CGAGCACGGC CGCCTCTCGG TGACATGGCG CGCCGACCGG GACGACCAGG GCGGAGGAAA CCTGCTGATC GAGTGGATCG AGGAGGGCAT CAGCCGGCCG CGCGAGGAAC AGAGCCCGAC GCGGCGCGGC TACGGACGCG AGTTGATCGA GCAGGCGATG CCCTACGCGC TCAACGCCAA GACGCACTAC GAACTCGGTG AGACGCGGCT GCGCTGCGCC ATCGAACTGC CGCTGGGCGA GCGGTTCGGG CAGGTGAGCA CGGCCTGA
|
Protein sequence | MDGSDSPPAQ SRRGEMAERI RAHDWTATAL GAADAWPPSL RATISLILGC GFPMIALWGR DLIQVYNDDF RDLMGSKHPA GLGQPARACW PEIWHITAPI YERVWNGETF TFEDALYPLF RSGRLEDAWF TLTYSPLRDE AERIAGILVT LVESTARVLS DRALRESEAR FRAELERQVQ ERTAELQASR DLLKATMDSS MDMIQVLKAV RDPAGEIIDF RWLLNNSTSA SRYGDVGGQS LLERNPGVIQ EGIFDTFKRV TETGQPATAE RRYAHEQFDG WFFQCAVKLG DGVAITTKEI SAWKAAQNEM LRLRDESANA ALRESEERFR TLASLIPVLL WRSDESGQHN SLNEAWLTYT GQTLQQSQAG GWLEAIHPAD RDAVSEAFRS GREQQRLIEV QQRIRRYDGQ YRWFLVRQAP LLDTEGQVTQ WIGAAMDIHD LHDLQERQTI LVAELQHRTR NLLGVVRSIA HQTMAQTGPT ERFREQFNDR LAALSRVQGL LSRSEQEPIT LRTLIRTELD ALGGGDFADR IHIAGPPVRL RKASVQTLAL AVHELATNAR KYGALTTEHG RLSVTWRADR DDQGGGNLLI EWIEEGISRP REEQSPTRRG YGRELIEQAM PYALNAKTHY ELGETRLRCA IELPLGERFG QVSTA
|
| |