Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1052 |
Symbol | |
ID | 4270525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1224961 |
End bp | 1226001 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125804 |
Product | diguanylate cyclase |
Protein accession | YP_741895 |
Protein GI | 114320212 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.169813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAGG CAGACCGGAA TCAACTGAAC GAGGCGGTAA ACCAAGCCTT TTTGCGCTAT CTGGACACCT GGCTGAGGAA GCGTGATGCC GCTGCGTTGC ATGACCTCAA CGGGCCGCTG GTGCATGGCT TCGGGACCGG TGCCAACGAG TCGGTCTATG ACCCGGACGA GGCCCGCGAG GTCATTGAAC GGGATATCAG CCAGGTGCCT GAGCCGTTTG AGTACGCCGT TCGTACGGCC AAGGTGACGC CGCTGTGCAC GGACGTGGCG CTGGTGGCGG CGGATATCGG GATTCGCAAC ACTATCCTGG CGCAAGGGCA GGAGCTGGCC CTGGACCATC TGCGCCTTAC CCTGGTCTTC CGATACATTG ACGGGGCCTG GCGACTGGAG CACCTGCACG GCTCCTTTCC CGCCACCGAG CCTGATGGAG AAGAGGCCTG GCCGTTGCAA GCGCTTGAGG ACCGTGCGGC GGTGCTTCAG CGCAAGGTGT GGGAGCGTTA TCGCGCGCAG GAGGCAGCCC GGCAATGCGA GGAGGGGCAG GCCAGCACCG ATGCGCTTAC CGGCCTTCCC AATCGCGAGA GGATGGATGA GCTGCTGCAC CGGGAGGTGG CGGGCCTGAA CGGGCAGTCC GGGGCCCTGG CGGTGATCCT GATTGACATC GACCATCTCA ATCTGGTCAA CGAGGGCCTT GGCCAGGAAG CGGGCGATCG GGTGTTGGCC GACGTCGCCA ATATCTTGCG CGATCGGATT CGCGTCACCG ATGCGTTGGC GCGGTGGGGC GGGGGGGCAT TCCTGTTGGC CTGCCCGATG ACGGACGGCC TGGAGGCCGA ATACCTGGCG CGGGCCCTGC GACGGGCCGT GGCGGAGACG GATTTCGGGT TGGGTTTCCC GTTGACGGCG AGTTTTGGGG TTACCGCCCT GCAAGCGGGG GATACCGTGC CGGGGCTGAT TCGGCGGGCT GAGCGTGGCC TGCGCCAGGC CAAAGAGGCG GGGCGGGATA CGGTGCAGGT GGTTTGTCGG GAGGAGATCG TGCCGGGCTA G
|
Protein sequence | MEQADRNQLN EAVNQAFLRY LDTWLRKRDA AALHDLNGPL VHGFGTGANE SVYDPDEARE VIERDISQVP EPFEYAVRTA KVTPLCTDVA LVAADIGIRN TILAQGQELA LDHLRLTLVF RYIDGAWRLE HLHGSFPATE PDGEEAWPLQ ALEDRAAVLQ RKVWERYRAQ EAARQCEEGQ ASTDALTGLP NRERMDELLH REVAGLNGQS GALAVILIDI DHLNLVNEGL GQEAGDRVLA DVANILRDRI RVTDALARWG GGAFLLACPM TDGLEAEYLA RALRRAVAET DFGLGFPLTA SFGVTALQAG DTVPGLIRRA ERGLRQAKEA GRDTVQVVCR EEIVPG
|
| |