Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1781 |
Symbol | |
ID | 6147213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1798598 |
End bp | 1799890 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616657 |
Product | sensory box-containing diguanylate cyclase |
Protein accession | YP_001743835 |
Protein GI | 170681443 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.900668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTGGC TTTTTTGCGA TCGGATAGCA ACAAAAATTG ATAAAAATAA CGGGATCTCA ATGATTACGC ACAACTTCAA TACCCTGGAC TTACTCACCA GTCCTGTCTG GATCGTTTCG CCCTTTGAGG AACAGTTAAT TTATGCCAAC AGCGCGGCGC GACTGTTGAT GCAAGACCTC ACGTTTAGTC AGCTACGAAC CGGTCCCTAT TCCGTCTCCT CACAAAAAGA ACTGCCGAAA TACCTCTCCG ATCTGCAAAA CCAACACGAT ATTATCGAAA TCCTCACCGT TCAACGTAAA GAAGAGGAAA CAGCATTAAG CTGTCGGCTT GTTTTGCGAG AGCTGACAGA AGCAGAACCG GTGATTATTT TCGAAGGTAT CGAAGCGCCG GCAACGCTGG GTTTAAAAGC CAGTCGCTCG GCAAATTATC AGCGCAAAAA ACAAGGTTTT TATGCGCGCT TTTTTCTGAC TAACTCTGCA CCAATGTTGT TGATTGACCC CTCACGAGAT GGCCAAATCG TTGATGCTAA CCTCGCCGCG CTCAATTTCT ATGGTTATAA CCATGAAACG ATGTGCCAGA AACATACCTG GGAAATAAAT ATGCTCGGGC GACGCGTCAT GCCTATCATG CATGAAATTT CGCATTTACC CGGTGGCCAT AAGCCTTTGA ATTTTGTTCA TAAATTGGCG GATGGTTCGA CTCGTCATGT GCAGACCTAT GCCGGTCCGA TTGAAATTTA TGGCGACAAG CTCATGTTAT GTATTGTGCA TGATATTACT GAGCAAAAAC GGCTGGAGGA GCAGCTGGAA CATGCTGCCC ACCATGACGC GATGACCGGA TTACTGAATC GGCGACAGTT TTATCACATT ACCGAGCCAG GCCAAATGCA GCACCTCGCC ATCGCTCAGG ATTACAGCTT GTTGCTCATC GACACCGATC GTTTTAAACA CATTAACGAT CTCTATGGGC ATTCTAAAGG TGATGAGGTG TTATGCGCCC TCGCCCGCAC CCTCGAAAGT TGCGCTCGCA AAGGCGATTT GGTGTTTCGT TGGGGAGGCG AAGAGTTTGT CTTATTGCTA CCAAGAACCC CACTGGATAC CGCGCTTTCG CTGGCTGAAA CTATCCGCGT AAGCGTGGCA AAAGTGAGTA TTTCGGGCTT ACCACGCTTT ACCGTCAGCA TTGGTGTGGC GCATCACGAA GGAAATGAAA GCATCGATGA GCTGTTTAAA CGCGTTGATG ATGCTTTGTA TCGGGCGAAA AATGATGGAC GCAACCGCGT GCTGGCGGCA TAA
|
Protein sequence | MTWLFCDRIA TKIDKNNGIS MITHNFNTLD LLTSPVWIVS PFEEQLIYAN SAARLLMQDL TFSQLRTGPY SVSSQKELPK YLSDLQNQHD IIEILTVQRK EEETALSCRL VLRELTEAEP VIIFEGIEAP ATLGLKASRS ANYQRKKQGF YARFFLTNSA PMLLIDPSRD GQIVDANLAA LNFYGYNHET MCQKHTWEIN MLGRRVMPIM HEISHLPGGH KPLNFVHKLA DGSTRHVQTY AGPIEIYGDK LMLCIVHDIT EQKRLEEQLE HAAHHDAMTG LLNRRQFYHI TEPGQMQHLA IAQDYSLLLI DTDRFKHIND LYGHSKGDEV LCALARTLES CARKGDLVFR WGGEEFVLLL PRTPLDTALS LAETIRVSVA KVSISGLPRF TVSIGVAHHE GNESIDELFK RVDDALYRAK NDGRNRVLAA
|
| |