Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2756 |
Symbol | |
ID | 6146815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2837355 |
End bp | 2838578 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617626 |
Product | hypothetical protein |
Protein accession | YP_001744787 |
Protein GI | 170683267 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0689329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00382869 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATAACG ATAATTCTCT TCATAAGCGC CCCACGTTTA AAAGAGCATT ACGCAACATC AGTATGACCA GCATATTTAT CACTATGATG CTGATCTGGT TGCTGCTTTC CGTGACCTCG GTGCTGACCC TGAAACAGTA CGCGCAAAAA AACCTGGCAC TGACAGCAGC AACAATGACT TACAGTCTGG AAGCAGCGGT CGTTTTTGCC GATGGCCCTG CAGCAACTGA AACACTGGCA GCGCTGGGGC AGCAAGGCCA ATTTTCAACT GCAGAAGTGC GTGATAAGCA GCAAAATATT CTGGCATCCT GGCATTACAC CCGTAAGGAT CCAGGCGATA CTTTCAGTAA TTTCATAAGC CACTGGCTCT TCCCTGCCCC CATCATTCAG CCGATTCGTC ACAATGGTGA AACCATTGGC GAAGTTCGCT TAACCGCTCG CGACAGTTCA ATCAGCCATT TCATCTGGTT TTCGCTCGCT GTCCTGACCG CCTGCATTCT GCTAGCGTCA GGCATCGCAA TTACCCTCAC CCGCCATTTG CACAATGGTC TGGTTGAAGC GTTAAAAAAC ATCACCGATG TCGTACATGA TGTGCGTTCC AACCGCAATT TTTCCCGACG AGTTTCGGAA GAACGTATCG CAGAGTTTCA CCGCTTCGCT CTCGACTTCA ACAGTCTGCT GGATGAAATG GAAGAGTGGC AGCTTCGTTT ACAGGCCAAA AATGCGCAGC TTCTACGTAC CGCACTACAT GACCCATTAA CCGGGCTGGC TAACCGCGCA GCGTTTCGTA GCGGTATCAA CACGTTGATG AACAGTTCCG ATGCCCGAAA AACGTCGGCG TTACTATTTC TTGATGGCGA TAATTTCAAA TATATCAATG ATACCTGGGG TCATGCGACG GGCGATAGAG TCTTGATTGA AATAGCAAAA CGGTTAACTG AATTTGGCGG GCTGCGACAT AAAGCATACC GCCTGGGCGG CGATGAATTC GCTATGGTGC TCTATGATGT ACAATCGGAA TCTGAAGTGC AGCAGATATG CTCAGCACTG ACACAAATTT TTAATCTCCC GTTTGATCTT CATAATGGCC ATCAGACTAC CATGACATTA AGCATTGGTT ACGCGATGAC CATTGAGCAC GCCTCTGCGG AAAATTTACA AGAGCTTGCC GATCACAATA TGTATCAGGC CAAACACCAG CGTGCCGAAA AGCTGGTGAG ATAA
|
Protein sequence | MDNDNSLHKR PTFKRALRNI SMTSIFITMM LIWLLLSVTS VLTLKQYAQK NLALTAATMT YSLEAAVVFA DGPAATETLA ALGQQGQFST AEVRDKQQNI LASWHYTRKD PGDTFSNFIS HWLFPAPIIQ PIRHNGETIG EVRLTARDSS ISHFIWFSLA VLTACILLAS GIAITLTRHL HNGLVEALKN ITDVVHDVRS NRNFSRRVSE ERIAEFHRFA LDFNSLLDEM EEWQLRLQAK NAQLLRTALH DPLTGLANRA AFRSGINTLM NSSDARKTSA LLFLDGDNFK YINDTWGHAT GDRVLIEIAK RLTEFGGLRH KAYRLGGDEF AMVLYDVQSE SEVQQICSAL TQIFNLPFDL HNGHQTTMTL SIGYAMTIEH ASAENLQELA DHNMYQAKHQ RAEKLVR
|
| |