Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3804 |
Symbol | |
ID | 5832210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4223977 |
End bp | 4225773 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641369596 |
Product | EAL domain-containing protein |
Protein accession | YP_001641249 |
Protein GI | 163853206 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0052374 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGAA CGGATTGCCG CGAGCGTCTC GAAGAAGATC GTTTGAATGC GCTCCAGGCA CTCAACCTCC TCGACACGCC GCCGAGCGAG AGCTTCGACC GCATCACACG GATGGCCAGC CAGATCTTCA ACCTGCCAAT CTCGGCGGTG TCGTTGACGG ATCGCGACCG GCAATGGTTC AAGTCGCGCA TCGGTGTCGA TCACTGCTCC ATCCCACGGG ATAAGGCGCC GTGCGGGCAG GTCGCCGAAA GCGCCGAAGT CCTCGTGATC CCCGATTTCG CGCAGGACGC CTTCTACGCC GACAGCGTGC TCGGCCGCTC CGGCATCCGC TTCTATGCCG GCGCACCGCT GGTCACGCGC GAGGGCTACA GCCTGGGTGC CCTCTGTGTC CTTGGAACCG AGCCCCGCAC GGTCTCTCCG GCCGAGATCG CCGCCCTCAC GGACCTCGCC GCGATGGTCA TGGCGCAGAT CGAGTTGCAG CACGCTTTCG GCCGGGTCGA TCCGTTGAGT GGCCTGCCCA GCCGGAACCA GTTCCTGGAC GACCTCGCCG ATCTCGCCGC CGAGCATCCG GATGAGGCCA GGATCGCTGT GCTCATCGAT CTCGGCCGAC CCGAGCAGGT CAGTGCCTAT AGCCGCGTCA TGGGTCCGGG CCGCGTTGAC GATCTGGTGC GGGAGGCCGC GCGGGAGTTG CGGCGCCTCA TCGGCCCAGG GCGCAAACTC TACCACACGG CGGCCACGCA GTTCACCTTT CTCGCTCCAC GCGGGGCTCA GCAGGACGAC TATGTCCGCC TCCTCGCGGA CGAGCACCGG CAGGCGCGGC AACGCTCCAT CACTGGGATG CTGCTGACGA GCGCGATCGG TGTGAGCGTG TTCAAGCCTT GCACGACGGC GCCACAAGAC GCGCTGCGCT CCCTCTACAG CGCGGTTCAG GACGCGCGCT CGTTGCACGA TCTCATCAGC GTCTACTCGT CCGTTGCCGA CGAGGCCTAC CAGCGCCGGT ACCAACTGCT CCAGGATTTC GGGCCGGCCC TCGGTGCCGA CGACCAGCTA CGCCTCGTTT TCCAGCCACG CATCGACCTG TCCACCGGTC GATGCATCGG CGCGGAGGCG CTGCTGCGCT GGGACCATCC AGAGTTGGGG CCCGTGTCCC CCGGCGAGTT CGTGCCGGTG ATCGAACTCT CGCCCCACGC GCAGGCGATG ACGGCCTTCG TTCTCGAGAG GGCGCTGGCG CAGGCGCGCC GCTGGCAGGA TGCCGGGCAC AGCTTGGTGA TGTCGGTCAA CATCTCAGCC GCGAACTTGA TCGAAGCTGG CTTCGGCCAG TCGGTCGAGG CCGGCCTTCG GCGCCACGGC CTTGCGCCCG GGCAGTTGGA ACTGGAAGTG ACTGAGAGCG CGATCATGCA AAATGCTGAA CAGGCGCGAC GTCAGTTGGA CTTGCTGGCC GCGGCCGGCA TTCGCTTGGC GATTGACGAC TTTGGGACGG GCTACAGCAG CTTGGCCTAC CTGCAGGACA TCCCAGCGCA CGTCGTGAAG ATCGATCAAA GCTTCGTGCG CAAGCTTGCG GATGGCGAGC GGGAACGATC GCTCGTCCAC TCGATGATCC ACCTCTCGCA TGATCTCGGC TACCGGGTGG TCGCGGAGGG CATCGAGACG GCGGAGGCAG CCGACCAAGT CAGGGCGATG GCCTGTGATG AGGCGCAAGG CTATCTCTTC GCCCGCCCGC TGGAGATCGG GACATTTGAG ACGTGGCTCA GGGAGCATGA GCAGGACGCC CGGTACGAGC CGGCACTGGC CAGCTGA
|
Protein sequence | MSGTDCRERL EEDRLNALQA LNLLDTPPSE SFDRITRMAS QIFNLPISAV SLTDRDRQWF KSRIGVDHCS IPRDKAPCGQ VAESAEVLVI PDFAQDAFYA DSVLGRSGIR FYAGAPLVTR EGYSLGALCV LGTEPRTVSP AEIAALTDLA AMVMAQIELQ HAFGRVDPLS GLPSRNQFLD DLADLAAEHP DEARIAVLID LGRPEQVSAY SRVMGPGRVD DLVREAAREL RRLIGPGRKL YHTAATQFTF LAPRGAQQDD YVRLLADEHR QARQRSITGM LLTSAIGVSV FKPCTTAPQD ALRSLYSAVQ DARSLHDLIS VYSSVADEAY QRRYQLLQDF GPALGADDQL RLVFQPRIDL STGRCIGAEA LLRWDHPELG PVSPGEFVPV IELSPHAQAM TAFVLERALA QARRWQDAGH SLVMSVNISA ANLIEAGFGQ SVEAGLRRHG LAPGQLELEV TESAIMQNAE QARRQLDLLA AAGIRLAIDD FGTGYSSLAY LQDIPAHVVK IDQSFVRKLA DGERERSLVH SMIHLSHDLG YRVVAEGIET AEAADQVRAM ACDEAQGYLF ARPLEIGTFE TWLREHEQDA RYEPALAS
|
| |