Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1109 |
Symbol | |
ID | 4269816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1295509 |
End bp | 1298337 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125861 |
Product | diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) |
Protein accession | YP_741951 |
Protein GI | 114320268 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.339045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.881736 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGA GCGAAACCGG AACCGCGCGA CGCCGACTCC GCCTGGGCCT GACCTGGCGG GTCATCGCCC TGACCAGTCT GGTGCTGGTG GGGCTGGCAG CCCTGGTCAC CGCGCATGGG CATGGCACCC TGGAGCGGCA GTTTCAGGAG GCCCGTGAGG CGCACCAGGC GCGCCAGCAT CGCGAGATCC GCCTGGCGCT GGAGCGATCG GCGGAGGGAC TGCGCGAGCT GGCGGCACTG GTGGCGTCGG CACCGGAGCT GGCCCGGGCC GTGCAGGCCG GTGATTCGGA GGGTGCGCAC CGCGCTCTCG ACGGGCAGTG GCCCACTCTG CAGTTGGAGG CCGGGGTGGA GCGTATCGGC GTCTACGCGC CAGACGGCAC GCAGGTGGCC GCCCGCGGCG GGAGGCCCGC GGTCGCGCCG GAGGGGATCG AGCGCTGGCT GCATCAGGTC CGTCTGGACG ACAGGCCACT GACCGCATTG ATCTGCCAGG CGGGCTGTCG TCAGTTCAGC GTGGTGCCCA TGCTGGTGGC GGGTGAGACC GCCGGTATGG TGATGCTGTC GCGCTCCCTG GTGGACCTCA CCCGCCAGCT GCACGATGTC TCCGGCAGTG ATGTGGCGCT GCTGCTACGC GGCGACACCA TGGTGGATCA GGCAGGGGAC ATCCCCCGGT TGCCGGAGTG GGACGCTCAT CTGCTCGCCC TCACCCGCAG TGAGGTGACC CTGCCCTGGC TGCAGGCCCT GGCCAGCGAG GTTAGCATGG AGGCCCTGGT AAGCGCCCCG CGGCAGATCG CGATCGACGG ACGGGAGTTG GAGATCAACG CGGTGACGCT GGCCGAGCAT GAGGCGCCCG GTGGCAGCGG CTACCTGCTG CTGATTACCG ATATCACCCG CCAGCTGGAT GTCATCCAGT GGCACACCCG CAACCTCTTC TTTATGGGGC TGGCCGGATG GCTCGTGGCG GAGGCGATCC TGCTGCTCAT CCTGTGGCGT CCGATGCTGC GTCTGCGCCG CCTGGCGCAG GTCCTGCCGG GGCTGGCCGA GGGCGGCTTT CAGCGGGCGC GTACCGCCAT TCCGGTCGGG CGCCCCCGGC TGGCCGATGA AATCGATCTG CTGGAGACCA CCACGCTCGG TCTGGCCGAT CAGCTGGAGG CCCTCGAGCG CGAGGTGCGG CGCCGGCGTG AGCAGGTGGT CTCCCAGCTG CGCGCCCTGG GGCGTGAGCG GGATTTCGTC AGCAGTCTGC TGGACACCGC CCGGGTCCTT ATCGTGAGTC AGGACGCCGA GGGGCGGATC ACGCTAATCA ACGACTACGC CCAGGCGGCG TTGGGCCGTC GCGAGGATGA GTTGGTGGGC GCGCACTTTG ACGCGGTCTT TCCGGGCCTG GTCCCCATCG GTCGTAACAG CGGCCTGCCT CGGGAGGAGG AGCGCCCGCT GCACAGCCCC GGCCGGGGGG AGCGGATCGT GGCCTGGTAC CACGCCCCGC TGGCGGCGGA GGAGGGGCGG CCGGCGGGCC GCATCTCGGT GGGGCTGGAT ATCACCGAGC GCAAGGCCGC GGAGGCGCGG CTGATCTGGC TGGCCCAGCG TGATCCGCTG ACCGAGCTCT ACAACCGCCG TTACTTTCAG GAGGCCCTCG ACAGGGCGTT GGCCAAGGGG GTGCACGGGG CGGTGCTGCT GATGGACCTG GACCAGTTCC GGGATGTCAA CGAGCTGAGT GGCCACCACG CGGGTGATGA GCTGTTGCGG TTGGTGGCCG GGACGTTGCT GGATCATCTG GAGCACCGGG GCGTGATCGC CCGCCTGGGC GGCGACGAGT TTGCACTGCT GCTGGAGGAG ACGGATGCCG ACGCGGCGGT CTCCGTGGCC CAGTACATGG TCAAGCTCCT GGAAGACCTG GGCCTGAGTA TCGGCGAGCG CCGCCACCGG GTCAGCGCCA GCATTGGCAT TGTGCTGTTC CCCGAACATG GCGAAACCCC GACCGATCTG ATGGCCAGTG CCGATGTGGC CATGTACAAG GCGAAAGAGA CAGGGGTCCA GCGCTGGCAC CTGCTCCACG CCCTGGACCA CGCCAAGGGC GAGCTGCAGG AGCGGGTCTA TTGGGTGGAG CAGTTACACC AGGCGCTCCA GGGTGATGCC TTCGAGCTCA TGGTGCAGCC CATCGTGCGG CTGCGGGACC GCAGTGTGCG CCACTACGAG GTGCTTGTGC GTATGCGCGA TCCGTCCGGT GAACTGCTGC TGCCCGGCCG GTTCATCCCC TTCGCCGAGC ACAGTGGCCA GATCGTTCAG CTGGACCGCT GGGTGCTGCG CGCGGCCCTG AAGGTGCTCC GCCGGGTGCA GTCACAGGGT ATTGGCCTGG CAGTGAACCT GTCGGGGCAG TCGCTCCACG ACGATGGACT GACGACCTTC CTGGCGGACG AGCTCCGCGC CAGTGGTGCG GACCCGGAGC ACCTGATACT GGAGATCACC GAGACCGCGG CGGTTACCGA TTTTTCCACC GCCCGAGGGG TGTTGGAGGG CATCCGCGCC TTGGGTTGTC AGACGGCATT GGACGATTTC GGGGTCGGGT TCAGCAGCTT CCATTACCTG GGCCAGTTGC CGGTGGACTA TATCAAGATC GACGGTAGCT TTATCCGCAG CCTGCCCCAC AACGAGGACA GCCGGATTAT CGTCAGGGCC ATCGCCGACA TTGCGGCCGG TTTCGGCAAG GCGGCCATTG CCGAGTTCGT TGACCAGGAG GTCCTGGTGC CGATGCTGCG TGACTACGGC ATCGCTTACG GCCAGGGCTA TCACCTGGGC CGGCCGGTGC CAGTGGAGGA GGCCTTCGGG CCAGCCTGA
|
Protein sequence | MRMSETGTAR RRLRLGLTWR VIALTSLVLV GLAALVTAHG HGTLERQFQE AREAHQARQH REIRLALERS AEGLRELAAL VASAPELARA VQAGDSEGAH RALDGQWPTL QLEAGVERIG VYAPDGTQVA ARGGRPAVAP EGIERWLHQV RLDDRPLTAL ICQAGCRQFS VVPMLVAGET AGMVMLSRSL VDLTRQLHDV SGSDVALLLR GDTMVDQAGD IPRLPEWDAH LLALTRSEVT LPWLQALASE VSMEALVSAP RQIAIDGREL EINAVTLAEH EAPGGSGYLL LITDITRQLD VIQWHTRNLF FMGLAGWLVA EAILLLILWR PMLRLRRLAQ VLPGLAEGGF QRARTAIPVG RPRLADEIDL LETTTLGLAD QLEALEREVR RRREQVVSQL RALGRERDFV SSLLDTARVL IVSQDAEGRI TLINDYAQAA LGRREDELVG AHFDAVFPGL VPIGRNSGLP REEERPLHSP GRGERIVAWY HAPLAAEEGR PAGRISVGLD ITERKAAEAR LIWLAQRDPL TELYNRRYFQ EALDRALAKG VHGAVLLMDL DQFRDVNELS GHHAGDELLR LVAGTLLDHL EHRGVIARLG GDEFALLLEE TDADAAVSVA QYMVKLLEDL GLSIGERRHR VSASIGIVLF PEHGETPTDL MASADVAMYK AKETGVQRWH LLHALDHAKG ELQERVYWVE QLHQALQGDA FELMVQPIVR LRDRSVRHYE VLVRMRDPSG ELLLPGRFIP FAEHSGQIVQ LDRWVLRAAL KVLRRVQSQG IGLAVNLSGQ SLHDDGLTTF LADELRASGA DPEHLILEIT ETAAVTDFST ARGVLEGIRA LGCQTALDDF GVGFSSFHYL GQLPVDYIKI DGSFIRSLPH NEDSRIIVRA IADIAAGFGK AAIAEFVDQE VLVPMLRDYG IAYGQGYHLG RPVPVEEAFG PA
|
| |