Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2751 |
Symbol | |
ID | 6065632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3023589 |
End bp | 3024689 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641602157 |
Product | late control D family protein |
Protein accession | YP_001725706 |
Protein GI | 170020752 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000516519 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATTTCA GCTCTGAACT GCTTAACAAA GGCAACAAAA CTCCGGCATT CAGCATCAGT ATTGAAGGTA GGGATATCAC CACTGTGCTG GACAACCGCC TGATGGGGCT GACGCTGACG GATAACCGGG GCTTTGACGC AGACCAGCTT GATCTGGAGC TGGACGACGC CGACGGAAAA ATCGTGCTGC CGCGCCGTGG TGCGGTCATT ACGCTGGCGC TGGGCTGGAA GGGGCAGCCG CTTTTCCCGA AAGGGGCATT CACGGTGGAC GAGATTGAAC ACACTGGCGC ACCGGACCGC CTGACTATCC GGGCGCGAAG TGCTGATTTT CGGGAAACGC TGAATACCCG TCGTGAAAAG TCGTGGCACA AGACCACCGT CGGGGAAGTG GTGAAGGAAA TAGCCGCGCG GCACAAGCTG AAGATGGCAC TGGGTAAAGA CCTGTCGGAT AAGCCCGTGG AGCATATAGA CCAGACTAAT GAGAGTGACG GCAGTTTTCT GATGCGGCTG GCGCGACAGT ACGGTGCCAT CGCGTCGGTG AAAAATGGCA ATCTGTTATT CATCCGGCAG GGGCAGGGCA AAAGCGCCAC TGGTAAACCA CTGCCAGTGA TCACTATCAC ACGCAAGGAC GGCGACAGTC ACCGCTTTAC CCTGGCAGAT CGCGGAGCCT ACACGGGCGT AATTGCCAGC TGGTTGCATA CCCGCGAACC TGCGAAGAAA GAAAGTACCA CGGTGAAGCG TAAGCGCAGA ACTAAGAAGC AGAAGAATGA GCCGGAAGCG AAGCAGGGCG ATTACCTGGT GGGTACGGAT GAAAACGTGC TGGTACTTAA TCGCACTTAT GCCAACCGGA GCAACGCTGA ACGGGCAGCG AAAATGCAGT GGGAACGCCT GCAACGCGGC GTTGCGTCAT TCTCGCTACA ACTGGCGGAA GGTCGGGCAG ATCTCTACAC GGAAATGCCT GTGAAGGTCA GCGGCTTTAA ACCGCCGATA GATGATGCGG AATGGACCAT TACGACTCTG ACACATACCG TCAGCCCGGA TAACGGTTTT ACGACCAGTC TGGAGCTTGA AGTGAGGATT GATGATTTCG AAATGGAATG A
|
Protein sequence | MNFSSELLNK GNKTPAFSIS IEGRDITTVL DNRLMGLTLT DNRGFDADQL DLELDDADGK IVLPRRGAVI TLALGWKGQP LFPKGAFTVD EIEHTGAPDR LTIRARSADF RETLNTRREK SWHKTTVGEV VKEIAARHKL KMALGKDLSD KPVEHIDQTN ESDGSFLMRL ARQYGAIASV KNGNLLFIRQ GQGKSATGKP LPVITITRKD GDSHRFTLAD RGAYTGVIAS WLHTREPAKK ESTTVKRKRR TKKQKNEPEA KQGDYLVGTD ENVLVLNRTY ANRSNAERAA KMQWERLQRG VASFSLQLAE GRADLYTEMP VKVSGFKPPI DDAEWTITTL THTVSPDNGF TTSLELEVRI DDFEME
|
| |