Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0942 |
Symbol | |
ID | 5593652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 941052 |
End bp | 942224 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640920112 |
Product | major tail sheath protein |
Protein accession | YP_001457679 |
Protein GI | 157160361 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGG ATTACCACCA CGGAGTGCGC GTTGTTGAAG TCAACGAAGG CACTCGATCC ATTACTACGG TGAGCACCGC CATCGTGGGC ATGGTCTGCA CGGGCGATGA TGCCGATGCA AAAATGTTCC CTCTTAATAA ACCCGTGCTG ATCACTGATG TGCTTACTGC CAGCGGTAAA GCGGGTGAGT CAGGTACGCT GGCCCGTTCG CTGGATGCCA TCGCTGACCA GGCAAAACCC GTGACCGTTG TTGTGCGTGT GCCGCAGGGT GAAACGGAAG AAGAAACCAC GACCAATATC ATCGGAGCAG TGACCGCTGA AGGTAAAAAA ACAGGCATGA AAGCCCTGTT ATCTGCCCAG TCACAGCTCG GCGTTAAACC GCGCATTCTC GGCGTGCCAG GCCACGACAC CAAGGCGGTA GCTACTGAGT TGCTGAGCGT GGCGCAAAGC CTGCGTGGAT TTGCTTACCT GTCAGCGTAT GGCTGCAAGA CGGTACAGGA GGCGATCACT TACCGTGAAA ACTTCAGCCA GCGCGAAGGG ATGCTGATCT GGCCTGACTT TACTGGCTGG GACACGGTGC TGAATGCCGA AGCAACGGCA TATGCCACCG CCCGTGCGCT TGGTCTGCGC GCCAAAATTG ACGAGCAGAC CGGATGGCAC AAAAGCCTGT CCAACGTGGG CGTGAACGGT GTCACCGGAA TTTCTGCTGA TGTGTTCTGG GATCTGCAGG ACCCGGCAAC CGATGCAGGT CTGCTGAACC AGAACGACGT CACCACGCTT GTGCGTAAAG ACGGTTTCCG CTTCTGGGGT TCCCGCTGCC TGAGTGATGA CCCGCTCTTT GCCTTCGAAA ACTACACCCG CACGGCGCAG GTGCTGATGG ACACGATGGC AGAAGCACAC ATGTGGGCGG TGGATAAACC GCTTAACCCG TCGCTGGCCC GCGACATTAT CGAGGGGATC CGCGCCAAAA TGCGCAGCCT GGTCAGTCAG GGCTATCTCA TTGGTGGTGA TTGCTGGCTG GATGAGTCGG TGAACGACAA AGACACGCTG AAAGCCGGAA AACTCACCAT CGACTACGAC TACACGCCAG TGCCGCCACT TGAAAACCTG ATGCTGCGTC AGCGCATCAC CGATCAGTAC CTGGTGAATT TCTCTAGCCA GGTCAGCGCG TAA
|
Protein sequence | MAQDYHHGVR VVEVNEGTRS ITTVSTAIVG MVCTGDDADA KMFPLNKPVL ITDVLTASGK AGESGTLARS LDAIADQAKP VTVVVRVPQG ETEEETTTNI IGAVTAEGKK TGMKALLSAQ SQLGVKPRIL GVPGHDTKAV ATELLSVAQS LRGFAYLSAY GCKTVQEAIT YRENFSQREG MLIWPDFTGW DTVLNAEATA YATARALGLR AKIDEQTGWH KSLSNVGVNG VTGISADVFW DLQDPATDAG LLNQNDVTTL VRKDGFRFWG SRCLSDDPLF AFENYTRTAQ VLMDTMAEAH MWAVDKPLNP SLARDIIEGI RAKMRSLVSQ GYLIGGDCWL DESVNDKDTL KAGKLTIDYD YTPVPPLENL MLRQRITDQY LVNFSSQVSA
|
| |