Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1373 |
Symbol | |
ID | 6143788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1360068 |
End bp | 1361666 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616251 |
Product | cyclic diguanylate phosphodiesterase domain-containing protein |
Protein accession | YP_001743431 |
Protein GI | 170681526 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.295034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0183228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAG CACAACGGAT CATTAAAACC TATCGCCGTA ATCGAATGAT TGTTTGTACG ATTTGCGCCC TCGTTACGCT CGCTTCGACC CTGAGCGTGC GATTTATTTC ACAGCGTAAC TTAAATCAAC AACGGGTAGT ACAATTCGCC AATCACGCTG TAGAGGAATT AGATAAAGTA CTGCTTCCCC TACAGGCAGG TAGCGAAGTC TTGCTTCCGC TGATTGGTCT GCCCTGCTCT GTCGCCCATT TGCCATTACG TAAACAGGCG GCAAAACTCC AAACTGTGCG ATCCATTGGC CTGGTGCAAG ACGGCACACT TTATTGCTCC AGCATTTTTG GTTATCGCAA TGTGCCCGTC GTGGACATTC TGGCTGAACT TCCTGCACCG CAACCACTTT TACGCCTGAC GATCGACCGT GCCCTGATTA AAGGCAGTCC GGTTTTGATT CAGTGGACGC CAGCAGCGGG CAGTAGCAAA GCTGGGGTCA TGGAGATGAT TAACATCGAC TTACTGGCGG CAATGCTGCT TGAGCCACAA CTGCCGCAAA TCAGTAGCGC CAGCCTGACG GTGGACGATC GGCATTTGCT CTATGGTAAT GGGCTGGTAG ATTCCCTTCC GCAACCTGAA AACAATGAAA ACTACCAGGT TTCTTCGCAA CGCTTTCCTT TTACCATTAA CGTTAATGGT CCGGGGGCTA CGGCGCTGGC ATGGCACTAT CTTCCAACAC AATTACCGCT GGCGGTGCTG CTAAGTTTAC TGGTGGGCTA CATCGCCTGG CTGGCGACCG CTTACCGGAT GAGCTTTTCC CGCGAAATCA ATCTGGGCCT GGCGCAACAT GAGTTCGAAT TGTTCTGTCA GCCTTTGCTT AATGCGCGCA GCCAGCAATG TATTGGTGTA GAGATTTTGC TTCGCTGGAA CAATCCGCGT CAGGGCTGGA TTTCACCGGA TGTGTTTATT CCTATCGCGG AAGAACATCA TTTAATTGTG CCACTGACCC GCTATGTGAT GGCAGAAACC ATTCGTCAGC GCCATGTTTT CCCGATGAGT AGTCAGTTTC ATGTTGGCAT TAACGTCGCA CCCAGCCATT TTCGCCGTGG TGTGCTGATA AAAGATCTCA ATCAGTACTG GTTTAGCGCT CACCCGATTC AGCAACTGAT CCTCGAAATC ACCGAACGCG ATGCCTTACT GGATGTTGAT TATCGGATTG CCCGCGAGCT ACATCGTAAA AACGTCAAAC TGGCGATTGA TGACTTCGGC ACCGGCAACA GTTCATTTTC CTGGCTTGAA ACATTACGTC CTGACGTGCT GAAAATTGAT AAGTCATTTA CCGCAGCTAT AGGTTCTGAC GCGGTTAACT CGACGGTGAC CGATATCATC ATCGCTCTGG GGCAAAGACT GAATATTGAA CTGGTGGCGG AGGGTGTGGA AACACAAGAA CAGGCGAAGT ATTTGCGCCG TCATGGGGTG CATATTTTGC AAGGGTATTT GTACGCACAG CCGATGCCGC TACGTGATTT TCCCAAATGG CTGGCGGGCA GCCAACCGCC GCCCGCCCGG CATAATGGAC ATATCACGCC CGTTATGCCG TTACGTTAA
|
Protein sequence | MQKAQRIIKT YRRNRMIVCT ICALVTLAST LSVRFISQRN LNQQRVVQFA NHAVEELDKV LLPLQAGSEV LLPLIGLPCS VAHLPLRKQA AKLQTVRSIG LVQDGTLYCS SIFGYRNVPV VDILAELPAP QPLLRLTIDR ALIKGSPVLI QWTPAAGSSK AGVMEMINID LLAAMLLEPQ LPQISSASLT VDDRHLLYGN GLVDSLPQPE NNENYQVSSQ RFPFTINVNG PGATALAWHY LPTQLPLAVL LSLLVGYIAW LATAYRMSFS REINLGLAQH EFELFCQPLL NARSQQCIGV EILLRWNNPR QGWISPDVFI PIAEEHHLIV PLTRYVMAET IRQRHVFPMS SQFHVGINVA PSHFRRGVLI KDLNQYWFSA HPIQQLILEI TERDALLDVD YRIARELHRK NVKLAIDDFG TGNSSFSWLE TLRPDVLKID KSFTAAIGSD AVNSTVTDII IALGQRLNIE LVAEGVETQE QAKYLRRHGV HILQGYLYAQ PMPLRDFPKW LAGSQPPPAR HNGHITPVMP LR
|
| |