Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1844 |
Symbol | |
ID | 3104644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1974142 |
End bp | 1976823 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637171003 |
Product | hypothetical protein |
Protein accession | YP_114281 |
Protein GI | 53803874 |
COG category | [S] Function unknown |
COG ID | [COG4458] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATGA CGACTTGGGA TGACAAGACG CTGAGCGAGC GTTGCGAGGC GATCTTCCGG GGTGCGGGGG ATGCGCTGGA ATGGGTGGCG GAGGTGCGCA CCAACGCCCA GCGGCTGGAC CGGGAGGGCG ACGGTCTGAT CGAGAAGCTC CGGCGTTCGC GCAACCAGTG CCGGCGCCTG GGCGCTGCGG CCAAGCGTCC GTTTTCCGCC GGGGTGTTCG GCATGTCGCA GGCCGGCAAG TCGTACCTGA TTTCCACCCT GGCGCGTTCC GCCGAGGGCG ATCTGCAGAC GCTGCTGGAC GGCAGGCGGG TCGATTTCAT CGGCCATCTG AATCCGCCCG GCGGCGGCAA GGAGGCGACG GGGCTGGTGA CGCGCTTCAC CCGCGAGCCC AGCGCCGCGC CGCAGGGATT CCCGGTCGAG CTGACCCTGT TTTCCGAGGC CGATATCGCC AAGATCCTGG GCAACAGCTT CTTCAACGAC TTCGACCGGG AGCGGGTGAG CTTCGACACC GATCCGGAAA AACTCCGCGC CCTGCTGGGG CGGCTGGAAC CCTTGGCCCA GCCCGAGCCG ACCGGGGGCC TGAGCGCCGA CGAGATCATC GATCTCATCG ATTACTTCGA GCGCCGCTTC GAGAAATCCT TCGCGCCCTT GCGCGCCGAC TACTGGCCCA GGGTGATCGA ACTCGCGCCG CGGCTGCGGA CCGAGCACCG GGCGGAACTG CTGTCGGTGC TGTGGGGCGG AATTCCCGAC CTGACCCGCG CCTATCGCCT GCTGGCCGAG GCCCTGGCGC GGATGTCGCA CGCCGGCACC GTGTTCGTGC CGCTGGCCGC GCTGGTGTCC GAGACCGGGG GCGAGCCGGC CTGGCGGGCG GACAGCATCC TCAACGTGGA TGTGCTGGAC GGACTCGGCA AGGAGGGCGG CGAAAGCCTG AAGGTCCTGC CCCAGCGTGA GGGCCAGGTC TTGGCCGAGG CAGAGGTGTC GCGTCCGGTG CTGGCGGCGC TGACGGCGGA GCTGCGCTTC GTGCTGGCCG ACGCCCCGGC GATCGGCCTG CTGGAAGAGG TCGATCTGCT GGACTTCCCC GGCTACCGCG GCCGGCTGGA CGTCGCAGAC CTGGAGGAGG TGCGCAAGCG GCTCAAGCGC GAGGACGCCG ACCCGGCGGC GCAGCTCATG CTGCGGGCCA AGGTCGCCTA TCTGTTCGAG CGCTACACCG AAGACCAGGA GATGAACGTG CTCCTGATGT GCACCCGCTG CGACAGCCAG ATCGAGGTCA CCTCGCTGGC GCCGGCCCTG TCCGCCTGGG TCCACGCTAC GCAGGGCGAG ACGCCGGCCG ACCGCGGCGC CCGTCCGCCC GGGCTGGTCT GGGTGGTCAC CCAGCTCGAC CGCCGGCTGG AAGCCAAGCC GGGCCAGACC GCCGCCCAGC AGCAACAGGA ATGGGCCAAC ATGGTCCACA TCACACTGCT GGAGCGTTTC TCCCAGTGCG ACTGGCTGCA CGAATGGAGC GAGGGCAAGC CGTTCGACAA CGTCTTCATG GTGCGCAAGC CCGGCATGCT GCGCAGCGCG TTCAAGGTCG ATGCCGATGG GGTGGAAACC GATTTCCTCT CGGAGGAGGA GCGCCAAAGG CTGGCCGGGC AGCGCGAGTT TTTCGTGGGT AACGAATCGG TGCAGCGCCA TGTCCGCGAT GCCGGCGTGG CCTGGGATGC GGTGCTGGGC GTCAATGACG GCGGCATGAC CCGGCTGGCC GAATATCTGC GCACCGTTTG TCTGCGCGAA ACGAAATGGA ACCGGATCGG CGAGCAGCTG GTCAAGATCC AGCAGGAAAT CGGCGAACAC CGGCTGCTGC CGTATTTCCA GGCCGAGGGT GCGAGTGAGG TCGAAAGGAA GAAGCGCTGT GCCGAACGGT TCTATCAGGC GGTGGTCGAG TCCCCGGACG GCTTCGGCGA ATTGCTCCAT CGCCTGCATC CCTCTGCGGA GCAGCTGCGG CGGCTGTACC TCACGGCAGA CGACGCCGGC GGGAAGGGCG AAACCGAGAC CGCCAGGCCC GCACCGGCAC GCCGCGGCCT CATCAACCTG CCCGTGGCGA AAACCGCCGG GCCGGTGGTG GAGCGTGGCG GGCGCGCAGA GAATTTCGCC AGAGCGGTGA TATCGGCCTG GATACTTCAG CTCCGGGCTT TGCCGGAGCA GGCCGACTTG CTACGCTATT TGGGGCTGGG CGAGGAGGCA GTGCGTATCG TCAGTGACGA GCTGGTCACC GGCGGTGACC GGCTGCAACT CGAGCGGAAA CTGGTGGAAG CGTTGCGTCC GCTGGAAGAG ATGCGGGGTA CCACCCGCAT CGGCATCGTC GATCAGCAGG TGATGGTGGT GCGGCGGGTC ATCGGCGAAT TCGTCGACCT GTTGGGATGG GCCGAAGTAC CCTTGGGTGC CCGGCCGGAA TCGCCGATGG GTGGGCGCCG ACTGTTCGAA CCGCCGGCGG CGATCGCGGC GGCCGCCTTG CCTCGGCTCA GCGTGGAAGA GCTGAATTAT CCGGCAGCCT TCATCGTCGA TTGGTTGGAA GCCTTCCGCA GACTGGCGTT GGACAATGCC GGGCATAGCG CCGGCCGCGA GATCACGCCG GAACAGAATC TGCGTCTGGG CGAAATCCTG GCGACCCTCG GCGTCGTAGC CGGGCATCGA GAGAGACGGT GA
|
Protein sequence | MHMTTWDDKT LSERCEAIFR GAGDALEWVA EVRTNAQRLD REGDGLIEKL RRSRNQCRRL GAAAKRPFSA GVFGMSQAGK SYLISTLARS AEGDLQTLLD GRRVDFIGHL NPPGGGKEAT GLVTRFTREP SAAPQGFPVE LTLFSEADIA KILGNSFFND FDRERVSFDT DPEKLRALLG RLEPLAQPEP TGGLSADEII DLIDYFERRF EKSFAPLRAD YWPRVIELAP RLRTEHRAEL LSVLWGGIPD LTRAYRLLAE ALARMSHAGT VFVPLAALVS ETGGEPAWRA DSILNVDVLD GLGKEGGESL KVLPQREGQV LAEAEVSRPV LAALTAELRF VLADAPAIGL LEEVDLLDFP GYRGRLDVAD LEEVRKRLKR EDADPAAQLM LRAKVAYLFE RYTEDQEMNV LLMCTRCDSQ IEVTSLAPAL SAWVHATQGE TPADRGARPP GLVWVVTQLD RRLEAKPGQT AAQQQQEWAN MVHITLLERF SQCDWLHEWS EGKPFDNVFM VRKPGMLRSA FKVDADGVET DFLSEEERQR LAGQREFFVG NESVQRHVRD AGVAWDAVLG VNDGGMTRLA EYLRTVCLRE TKWNRIGEQL VKIQQEIGEH RLLPYFQAEG ASEVERKKRC AERFYQAVVE SPDGFGELLH RLHPSAEQLR RLYLTADDAG GKGETETARP APARRGLINL PVAKTAGPVV ERGGRAENFA RAVISAWILQ LRALPEQADL LRYLGLGEEA VRIVSDELVT GGDRLQLERK LVEALRPLEE MRGTTRIGIV DQQVMVVRRV IGEFVDLLGW AEVPLGARPE SPMGGRRLFE PPAAIAAAAL PRLSVEELNY PAAFIVDWLE AFRRLALDNA GHSAGREITP EQNLRLGEIL ATLGVVAGHR ERR
|
| |