Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0606 |
Symbol | |
ID | 6065337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 648220 |
End bp | 649308 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641600012 |
Product | CblD family pilus biogenesis initiator protein |
Protein accession | YP_001723609 |
Protein GI | 170018655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAATC GATTGATTGC GGCGATATTG GGCTTGTGTG GTGCGGTTAC TGGCGTTCAG GCAGCTCCTA ACGTGACCAG TGAAATTACG TACGATTTGG CATCTGGCAG AGCGGATTAT TACTTCTGGA ATGAGGAGCC CCCACCGGAG GTGAGTTACA GTACAACATT TTCATTTTTT CAATGTAGCT ACCCTGATTC ACAGCAGACT TGTACATCAG CAGGTAATAC TTCTGTCGTG CAAATTTATC TAACTGAAAA ACGCAGCGGT ATGCGCTGGC CGGTTAAACT GAAAGGGTAT ATGACAGTTC AGGTGTGGGA GGACGGACCG TGTAAGGGGT GGTACGATAA GAAAAGGCTG GATGATGGGA CGGGTTATCA ATGTAAAGAT ACGATTAATA ACGTTGGTTA TCTGGCTAAA ACAAAAGTTT TAACTCTGTA TATTGAGCAA GAAGAAATGA AGAAACTGCC GATTGGCGGT TTATGGGAAG GGAAAGTTAA ACTCCATTTT AGCTACCCGG CAACAGATTA TCAGGCTGAT ATTAAGCTTA ATGTTCTCGA CCCCAACCAT ATCGACGTGT TCTTCCCGGA GTTCGCCCAC GCCACGCCAC GGGTGCAGTT AGACTTGCAT CCAACAGGGA GCGTTAATGG CAGCAACTAC GCGCAAGATC TGACCATGTT GGACATGTGC TTGTACGATG GTTTTAACGG TAATGCCATC AGTTATGAAA TCATGCTCAA AGATGAAGGG CGACCCGCCG CAGGGCGCAG AGACGGTGAC TTCTCTATCT ATCGTCAGGG AGGAACCACC ACCGACGAGG GAGAACGCAT TGATTACCGG GTCAAAATGT ACAACCCGGA AACCGGTGGG CAAATTGATG TGCGCAATAA TGAAAATATG GTCTGGAACA GCATTAACCT GAAACGTGTG CGTCCGGTCG TATTGCCAGG GATCCGCTAT GCCGTAATGT GTGTGCCAAC GCCATTAACG CTGGCAGTAG AAAAATTTAG CGTGATGGAC AAACAGGCTG GATATTACAT GGGGAAATTG TCGGTAATCT TTACGCCTTC CTTGCCAACC ATCAATTAA
|
Protein sequence | MRNRLIAAIL GLCGAVTGVQ AAPNVTSEIT YDLASGRADY YFWNEEPPPE VSYSTTFSFF QCSYPDSQQT CTSAGNTSVV QIYLTEKRSG MRWPVKLKGY MTVQVWEDGP CKGWYDKKRL DDGTGYQCKD TINNVGYLAK TKVLTLYIEQ EEMKKLPIGG LWEGKVKLHF SYPATDYQAD IKLNVLDPNH IDVFFPEFAH ATPRVQLDLH PTGSVNGSNY AQDLTMLDMC LYDGFNGNAI SYEIMLKDEG RPAAGRRDGD FSIYRQGGTT TDEGERIDYR VKMYNPETGG QIDVRNNENM VWNSINLKRV RPVVLPGIRY AVMCVPTPLT LAVEKFSVMD KQAGYYMGKL SVIFTPSLPT IN
|
| |