Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2763 |
Symbol | |
ID | 5595377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2782775 |
End bp | 2783998 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640921879 |
Product | hypothetical protein |
Protein accession | YP_001459398 |
Protein GI | 157162080 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.000000512321 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACG ATAATTCTCT TAATAAGCGC CCCACGTTTA AAAGAGCATT ACGCAACATC AGTATCACCA GCATATTTAT CACTATGATG CTGATCTGGT TGCTGCTTTC CGTGACCTCG GTGCTGACCC TGAAACAGTA CGCGCAAAAA AACCTGGCAC TGACAGCAGC AACAATGACT TACAGTCTGG AAGCAGCTGT CGTTTTTGCC GATGGCCCTG CAGCAACTGA AACACTGGCA GCGCTGGGCC AGCAAGGGCA ATTTTCAACT GCAGAAGTAC GTGATAAGCA GCAAAATATT CTGGCGTCCT GGCATTACAC CCGTAAGGAT CCAGGCGATA CTTTCAGCAA TTTCATAAGC CACTGGCTCT TCCCCGCCCC CATCATTCAG CCGATTCGTC ACAATGGTGA AACCATTGGC GAAGTACGCT TAACCGCTCG CGACAGTTCA ATCAGCCATT TTATCTGGTT TTCGCTCGCC GTACTGACCG GTTGTATTCT GCTGGCATCA GGCATCGCAA TTACCCTCAC CCGCCATTTG CACAATGGCC TGGTGGAAGC ACTGAAAAAT ATCACCGATG TCGTACATGA TGTGCGTTCC AACCGCAATT TTTCCCGACG AGTTTCGGAA GAACGTATCG CTGAGTTTCA CCGCTTCGCT CTCGACTTCA ACAGTCTGCT GGATGAAATG GAAGAGTGGC AGCTTCGTTT ACAGGCTAAA AATGCGCAGC TTCTACGTAC CGCGCTACAT GACCCATTAA CCGGGCTGGC TAACCGCGCA GCGTTTCGTA GCGGCATCAA CACGTTGATG AACAATTCCG ATGCCCGAAA AACGTCGGCG TTACTATTTC TTGATGGCGA TAATTTCAAA TACATCAATG ATACCTGGGG TCATGCGACG GGCGATAGAG TCTTGATTGA AATCGCAAAA CGGTTAGCTG AAGTTGGCGG GCTGCGACAT AAAGCATACC GCCTGGGCGG CGATGAATTC GCTATGGTGC TCTATGATGT ACAGTCAGAA TCTGAAGTGC AGCAGATATG CTCAGCACTG ACACAAATCT TTAATCTCCC GTTTGATCTT CATAATGGTC ATCAGACCAC CATGACATTA AGCATTGGTT ACGCGATGAC CATTGAGCAC GCCTCTGCGG AAAAATTACA AGAGCTTGCC GATCACAATA TGTATCAGGC CAAACACCAG CGTGCCGAAA AGCTGGTGAG ATAA
|
Protein sequence | MDNDNSLNKR PTFKRALRNI SITSIFITMM LIWLLLSVTS VLTLKQYAQK NLALTAATMT YSLEAAVVFA DGPAATETLA ALGQQGQFST AEVRDKQQNI LASWHYTRKD PGDTFSNFIS HWLFPAPIIQ PIRHNGETIG EVRLTARDSS ISHFIWFSLA VLTGCILLAS GIAITLTRHL HNGLVEALKN ITDVVHDVRS NRNFSRRVSE ERIAEFHRFA LDFNSLLDEM EEWQLRLQAK NAQLLRTALH DPLTGLANRA AFRSGINTLM NNSDARKTSA LLFLDGDNFK YINDTWGHAT GDRVLIEIAK RLAEVGGLRH KAYRLGGDEF AMVLYDVQSE SEVQQICSAL TQIFNLPFDL HNGHQTTMTL SIGYAMTIEH ASAEKLQELA DHNMYQAKHQ RAEKLVR
|
| |