Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0985 |
Symbol | |
ID | 3102462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1030441 |
End bp | 1033371 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637170173 |
Product | sensory box protein |
Protein accession | YP_113464 |
Protein GI | 53804907 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.928504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCCGGG CGATGATCGA ATGCCTCCAG GATGAGGTGG TGACGGTCCA CGACATCGAA GACTGCGCCC GCATCATCTA CGCGAACGAA GCGGCCTGTC GCCATTTCGG CGTGGGTCTC GAAACCCTGT TGACCTGGTG CCCGCTGGAT TTCGATCCCA CATACGACGA AGAATCCCTG CGTCTAGCCC TCGAACGGCA CCGTCAGACC GGGGGCACGA AGTTCGAGAC GCTGCACCGC GTGGCGGCGG GGATGGAGAT TCCTGTCGAG GTGACGGCGA ACCTGTTCGA GCAGGACGGC AGGTTACTGG CGGTGTGCAT CAGCCGCGAC ATTCGCCCCC GCCGTGAAGC CGAGGCGCGC ATGAAGGAGC TCGAGAGGCT GCAGGCCGAG CGGGAAGGGC TGGACCGTCT CGCCCGCTTC GCCCAGTGTG CGCCGGGATT CATGTATACC GTGGAGGCGC GCGCCGACGG ACGTCTCGTG ATGACGTACG CCAGCTCCGG CGTGGAGGAC ATCCTGGGAC TGAGCGTGGA AGCGGTCTTG GCAGACATCG GCCATGTTCA TTCGCGCATC GTTCCCGAAG ACCGTGACCG GGTCATGCGC TGCAAAGAGG CCTCGGCTCG TGAGCTGCAG CCGTTCACGG AGGAATACCG TATCCTTCAC CCCGTCAAGG GCGAGCGGTG GCTGGAGGCC CGCTCCGTGC CGGAGCGGCA GAAAGACGGC ATCACCCTGT GGCACGGTTT CCTGATCGAC ATCACCGAGC GGAAGCGGAT GGAGGAAACC TTGCAGTTCA TCGCACAGCG CAGCTGGGCC ACGGAAGGCG AGGCATTCTT TTCCGCCATT GCGCATCACC TCGGCAGGAT TCTCGGGGTC GATTACGTCG TCATCGACCG GCTAGCCAGG GACGCCGAAC TCGCCGAGAC CGTCGCGCTG TATGCCCGCG GCGAAGACTT GCCCAACATC TGCTACAGAC TCGCCGGAAC GCCCTGCGAA AACGTCATGG GCCGCAGCCT ATGCTGTTTC CCCCGCCAAG TACGCCGGCT GTTTCCTGAC GACCTCATGC TGCAGGACAT GGGGGTGGAA AGCTACGTCG GCGTGCCGCT GTGGGATTCG GCCGGAACCG CGATCGGCCT GATCGCGGTG ATGCACGGCG AGCCGATGGC GGAGCCGGAG TCCGTGACCG CCCTCCTGCA GCTCGTGGCG ACCCGGGTCG CGGCGGCTCT GGAACGCGAC CGCTCGGAGC GGCAGCTCAA GGCGCGTGAA CAGGAGTTCC GTTCCCTGGT CGAGCACTCG CCCGACACCG TCGCCCGCTA TGACCGGGAC TGCCGCCGTA TCTACGCCAA TCCCCGGCTG GTGGAAGAAG CCGGCGTGCC TCTCTCGGAA CTCTTGGGGA AAACGCCGGT CGAGTTTCCG GGCGGCGAAA GTTCCCGTGC CTACCAGGCG AAGATTCGGC AGGTTTTCGA AACGGGGGAG CCGGCGGAGT TCGAGCTGAG CTGGAAAACG GCGCAGGAAC GGGAGCTGAT CTCCCATATC CGGCTGACGC CGGAATTCGG TTTCGACGGC GAAGTCATGC ATGCGCTGGC GATCGGGCGG GACATCACCG AAATCGACGC CTATCGGCGG CGTATCCATC ACCTCGCATT TTTCGATTCC TTGACCGGAC TGCCGAATCG GGAAATGCTC AACAAGCGTA TCCGGGAGGT CATGGATGGC AATCCCCGTC CGGGCCGGCA GTTCGCCCTG ATGATGCTGG ACCTCGACCG CTTCAAGGAA ATCAACGACA CCCTGGGACA CGGGATCGGC GATCTGCTGC TGGGAAGAGC CGCCCGCCGT TTGCTGGGGG CGGTCGGCAA GGACGATACG GTGGCGCGGC TGGGCGGCGA TGAATTCGCC GTTCTGGTGC CGGCGCCGGC CGCTTCCCAA GAACTCACGG CGCTGGCTGG CCGGATACTC GACGCCTTTG CCCGGCCTTT CCTGATCGAG GGCCGCGAGC TGTTCGTTTC GGTCAGTCTC GGTATCGCGC TTTACCCCCG GGACTGCACC GGCATCGATA CCTTGTTCCG TTATGCCGAC ACCGCCATGT ATCACGCCAA GCGGCAGGGG AGGAACAACT TTCAGTTCTA TTCGGCCGAG CTCACGGCTC GGGCGGCGGA GCGCATGCGG ATCGAATCCG CCCTGCGCCG GGCGCTGGGG CGCAACGAAC TGGAGCTGCA CTTCCAATCG CAGATCGACA TGGTTTCCGG AACCATCGTC GGCGCCGAGG CCCTGCTGCG ATGGAACAGG CCGGGACGCG GCATCATACT GCCGCGTAAA TTCATCCCGA TCGCGGAGGA AACCGGTCTG ATCGTCGGCA TCGGCGACTG GATCCTGGCC CAGGCCTGCC AGGCCGTCGT TGCCTGGAAC CGCCACCGCG AGCAGCCGCT GCGCGTAGCC GTCAACCTTT CCACTCGCCA GTTCATCCTG AACGACTTGG CCGGTACGGT GCAGCGCATT CTGGACGAGA CCGGCTGCCG GCCGGAGTGG CTGGAGCTGG AAATCACCGA AAGCCTGCTG CTGGAAGACA GCAGGGGAGT CCGCGCGACC CTCGATGCCT TCGACCGCAT GGGGCTGTCG ATCGCCATCG ATGACTTCGG TACCGGCTAT TCCGCCTTGA GCTATCTGCA TCGCTTTCCG GTGAAACGGA TCAAGATCGA TCGCTGTTTC GTCCACGGCA TTCCGTCCGA CCGCTGCAAA TCGGAGCTGG TCAAAGCCAT AATCTCGATT GCCCAGGCCC TGGGCCTGGA GGTGCTGGCG GAAGGCGTGG AAACCCCACG ACAGGCCGTC TATCTCCAGG CGCATGGCTG CCGTCTGGGC CAGGGCTATC TGTTCGGTGA GCCGCAACCG CTTGTCGGTT TCGAGGCATC GATCGGACAG GACCGGAGCG CCATTTCCTG A
|
Protein sequence | MFRAMIECLQ DEVVTVHDIE DCARIIYANE AACRHFGVGL ETLLTWCPLD FDPTYDEESL RLALERHRQT GGTKFETLHR VAAGMEIPVE VTANLFEQDG RLLAVCISRD IRPRREAEAR MKELERLQAE REGLDRLARF AQCAPGFMYT VEARADGRLV MTYASSGVED ILGLSVEAVL ADIGHVHSRI VPEDRDRVMR CKEASARELQ PFTEEYRILH PVKGERWLEA RSVPERQKDG ITLWHGFLID ITERKRMEET LQFIAQRSWA TEGEAFFSAI AHHLGRILGV DYVVIDRLAR DAELAETVAL YARGEDLPNI CYRLAGTPCE NVMGRSLCCF PRQVRRLFPD DLMLQDMGVE SYVGVPLWDS AGTAIGLIAV MHGEPMAEPE SVTALLQLVA TRVAAALERD RSERQLKARE QEFRSLVEHS PDTVARYDRD CRRIYANPRL VEEAGVPLSE LLGKTPVEFP GGESSRAYQA KIRQVFETGE PAEFELSWKT AQERELISHI RLTPEFGFDG EVMHALAIGR DITEIDAYRR RIHHLAFFDS LTGLPNREML NKRIREVMDG NPRPGRQFAL MMLDLDRFKE INDTLGHGIG DLLLGRAARR LLGAVGKDDT VARLGGDEFA VLVPAPAASQ ELTALAGRIL DAFARPFLIE GRELFVSVSL GIALYPRDCT GIDTLFRYAD TAMYHAKRQG RNNFQFYSAE LTARAAERMR IESALRRALG RNELELHFQS QIDMVSGTIV GAEALLRWNR PGRGIILPRK FIPIAEETGL IVGIGDWILA QACQAVVAWN RHREQPLRVA VNLSTRQFIL NDLAGTVQRI LDETGCRPEW LELEITESLL LEDSRGVRAT LDAFDRMGLS IAIDDFGTGY SALSYLHRFP VKRIKIDRCF VHGIPSDRCK SELVKAIISI AQALGLEVLA EGVETPRQAV YLQAHGCRLG QGYLFGEPQP LVGFEASIGQ DRSAIS
|
| |