Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2756 |
Symbol | |
ID | 6065615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3029254 |
End bp | 3030426 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641602162 |
Product | tail sheath protein |
Protein accession | YP_001725711 |
Protein GI | 170020757 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000512477 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCAGG ATTACCACCA CGGAGTGCGC GTTGTTGAAG TCAACGAAGG CACCCGATCC ATTACCACGG TGAGCACCGC CATCGTGGGT ATGGTCTGCA CGGGCGATGA TGCCGATGCA AAAATGTTTC CTCTTAATAA ACCCGTGCTG ATCACTGATG TGCTGACTGC CAGCGGTAAA GCGGGTGAGT CCGGCACGCT GGCCCGTTCG CTGGATGCCA TCGCTGACCA GGCAAAACCC GTGACCGTTG TTGTGCGTGT GCCGCAGGGG GAAACGGAAG AAGAAACCAC GACCAATATC ATCGGCGCAG TGACTGCTGA AGGTAAAAAA ACAGGCATGA AAGCTCTGTT ATCTGCCCAG TCACAGCTCG GCGTTAAACC GCGCATTCTC GGCGTGCCAG GCCACGATAA CAAAGCCGTT GCGACTGAGT TGCTGAGCGT GGCGCAAAGC CTGCGTGGGT TTGCTTACCT GTCAGCGTAT GGCTGCAAGA CAGTGCAGGA GGCGATCACT TACCGCGAAA ACTTCAGCCA GCGCGAAGGG ATGCTGATCT GGCCCGACTT TACTGGCTGG GACACGGTGC TGAATGCCGA AGCAACGGCT TATGCCACCG CTCGTGCGCT TGGTCTGCGC GCCAAAATTG ACGAGCAGAC CGGATGGCAC AAAAGCCTGT CCAACGTGGG CGTGAACGGT GTCACCGGAA TTTCTGCTGA TGTGTTCTGG GATCTGCAGG ACCCGGCAAC CGATGCAGGT CTGCTGAACC AGAACGACGT CACCACGCTT GTGCGTAAAG ACGGTTTCCG CTTCTGGGGT TCCCGCAGCC TGAGTGATGA CCCGCTCTTT GCCTTCGAAA ACTACACCCG CACGGCGCAG GTGCTGATGG ACACGATGGC AGAAGCACAC ATGTGGGCGG TGGATAAACC GCTTAACCCG TCGCTGGCCC GCGACATTAT CGAGGGCATC CGCGCCAAAA TGCGCAGCCT GGTCAGTCAG GGGTATCTCA TTGGTGGTGA TTGCTGGCTG GACGAGTCGG TGAACGACAA AGACACTCTG AAAGCCGGAA AACTCACCAT CGACTACGAC TACACGCCAG TGCCGCCACT TGAAAACCTG ATGTTGCGTC AGCGCATCAC CGATCAGTAC CTGGTGAATT TCTCCAGCCA GGTCAGCGCG TAA
|
Protein sequence | MAQDYHHGVR VVEVNEGTRS ITTVSTAIVG MVCTGDDADA KMFPLNKPVL ITDVLTASGK AGESGTLARS LDAIADQAKP VTVVVRVPQG ETEEETTTNI IGAVTAEGKK TGMKALLSAQ SQLGVKPRIL GVPGHDNKAV ATELLSVAQS LRGFAYLSAY GCKTVQEAIT YRENFSQREG MLIWPDFTGW DTVLNAEATA YATARALGLR AKIDEQTGWH KSLSNVGVNG VTGISADVFW DLQDPATDAG LLNQNDVTTL VRKDGFRFWG SRSLSDDPLF AFENYTRTAQ VLMDTMAEAH MWAVDKPLNP SLARDIIEGI RAKMRSLVSQ GYLIGGDCWL DESVNDKDTL KAGKLTIDYD YTPVPPLENL MLRQRITDQY LVNFSSQVSA
|
| |