Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2777 |
Symbol | |
ID | 6064863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3043849 |
End bp | 3045615 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641602183 |
Product | hypothetical protein |
Protein accession | YP_001725732 |
Protein GI | 170020778 |
COG category | [S] Function unknown |
COG ID | [COG5484] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000134113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACACCA CACTGACACC CGCAGATCTC GATCCCCGTC GGCAGGCCAT GCTGCTGTAC TTTCAGGGAT ACCGCGTAGC CCGCATAGCT GAAATGCTGG GCGAGAAAGT TGCAACCGTT CACAGCTGGA AAAAACGCGA CAAGTGGGGT GACTATGGGC CGCTGGATCA GATGCAGCTC ACCACCACCG CACGCTACTG CCAGCTCATC ATGAAGGAGC ACAAAGAAGG GAAAGATTTC AAAGAGATTG ACCTGCTGGC GCGCCAGTCG GAGCGCCACG CGCGGATCGG CAAGTTTAAC AATGGCGGCA ACGAAGCCGA CTTAAACCCT AACGTCGCCA ACCGCAACAA AGGCCCACGC CGTCAGCCGG AAAAGAATGT TTTCACCGAT GAGCAGATTG AGAAGCTGGA AGAAATCTTC CATTCCTCCA TGTTCAACTA CCAGCGCCAC TGGTGGGAAG CCGGAAAAAC CAACCGCATC CGCAACCTGC TGAAGTCACG CCAGATCGGC GCGACCTTTT ACTTTGCCCG TGAAGCCCTG ATTGACGCCC TGCTTACCGG GCGTAACCAG ATTTTCCTTT CCGCCAGTAA GGCACAGGCT CACGTTTTTA AGCAGTACAT CATCGACTTC GCCAAAGAAG TCGAGGTGGA GCTGAAAGGC GATCCGATGG TGCTTCCTAA CGGAGCCACG CTTTACTTCC TCGGCACCAA TGCCCGCACG GCCCAGAGTT ACCACGGCAA CCTGTATCTG GATGAATATT TCTGGATACC GAAATTCCAG GAACTGCGCA AAGTGGCTTC CGGTATGGCT ATTCACAAAA AATGGCGACA AACCTATTTT TCCACGCCAT CCAGCCTGAC ACACAGTGCT TATCCGTTCT GGTCCGGTGC GCTGTTCAAC CGTGGGCGCA ACAAAGCCGA TAAGGTGGAC ATCGACCTGT CCCACAGCAA TCTGGCCCCC GGCCTGCTGT GCGCAGACGG GCAATACCGC CAGATAGTCA CCGTGGAAGA TGCAGTGCGC GGCGGATGTA ACCTTTTCGA CCTTGACCAG TTGCGCATGG AGTACAGCCC GGACGAATAC CAGAACCTGC TGATGTGCGA GTTTGTGGAC GATCTCGCGT CCGTGTTTCC GCTCAGCGAG CTGCAGGCGT GCATGGTGGA CAGCTGGGAA GTCTGGACCG ACTTTCATGC ACTGGCTCTG CGCCCGTTTG GCTGGCGCGA AGTGTGGATC GGTTATGACC CGGCAAAAGG TACGCAAAAC GGCGACAGCG CCGGATGCGT GGTGGTGGCA CCGCCAGCCG TGCCGGGCGG TAAGTTCCGC ATTCTTGAGC GTCACCAGTG GCGCGGGATG GACTTCCGCG CCCAGGCTGA CGCCATCAAA AAACTGACCG AACAGTACAA CGTGACCTAT ATCGGTATCG ACTCGACCGG CGTTGGTCAC GGGGTTTATG AGAACGTGAA AGCGTTCTTT CCTGCCGTCC GGGAGTTTGT CTACAACCCC AACGTTAAAA ACGCCCTGGT ACTCAAGGCC TACGACATTA TCAGCCACCG CCGTCTGGAG TTTGACGCCG GGCACACCGA CATTGCGCAG TCATTCATGG CTATCCGTCG CGCCACCACC GCCAGCGGCA ACCGCCCGAC CTATGAAGCC AGCCGCAGCG AAGAAGCCAG CCACGCCGAT CTGGCCTGGG CAACAATGCA CGCACTGTTT AACGAACCGC TGCAGGGCGA GTCCGCCAAT ACCAGCAATA TTGTGGAGAT TTTTTGA
|
Protein sequence | MNTTLTPADL DPRRQAMLLY FQGYRVARIA EMLGEKVATV HSWKKRDKWG DYGPLDQMQL TTTARYCQLI MKEHKEGKDF KEIDLLARQS ERHARIGKFN NGGNEADLNP NVANRNKGPR RQPEKNVFTD EQIEKLEEIF HSSMFNYQRH WWEAGKTNRI RNLLKSRQIG ATFYFAREAL IDALLTGRNQ IFLSASKAQA HVFKQYIIDF AKEVEVELKG DPMVLPNGAT LYFLGTNART AQSYHGNLYL DEYFWIPKFQ ELRKVASGMA IHKKWRQTYF STPSSLTHSA YPFWSGALFN RGRNKADKVD IDLSHSNLAP GLLCADGQYR QIVTVEDAVR GGCNLFDLDQ LRMEYSPDEY QNLLMCEFVD DLASVFPLSE LQACMVDSWE VWTDFHALAL RPFGWREVWI GYDPAKGTQN GDSAGCVVVA PPAVPGGKFR ILERHQWRGM DFRAQADAIK KLTEQYNVTY IGIDSTGVGH GVYENVKAFF PAVREFVYNP NVKNALVLKA YDIISHRRLE FDAGHTDIAQ SFMAIRRATT ASGNRPTYEA SRSEEASHAD LAWATMHALF NEPLQGESAN TSNIVEIF
|
| |