Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2218 |
Symbol | |
ID | 5586194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2183402 |
End bp | 2184307 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640925886 |
Product | phage baseplate assembly protein V |
Protein accession | YP_001463286 |
Protein GI | 157157570 |
COG category | [R] General function prediction only |
COG ID | [COG4540] Phage P2 baseplate assembly protein gpV |
TIGRFAM ID | [TIGR01644] phage baseplate assembly protein V |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.033632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAAT CGGACGATGA TGAATACGCC GCAGCGGAGA ACGCGCGCAG GCTGCGTGAC GCGGTTAAAC GCGGCACGAT AGCTGCAGTT CAAATGAAAC CCCCTCGCTG TCGGGTCTCA TTTGGCGGCG AACACCAGTC GGGTTGGCTG CAATGGTTTA CTCACGCAAC GTCCGAGCGA ATCGACTGGA GCGCACCATC AGTAGGTGAC CCCGTTACTG TTGTTTCAGA GGGTGGGGAC ACGCGGAACG GCGTAGTTAT GCTCGGACTG CACATTGACA ACGTAGATCC GCCCAGTAGC GACCCCCATG ATCACGTCAC TGCGTATTGT GACGGGGCCA GGCTGACGTA TAACACGAAA AACCACACGC TGACCTGGCA GGGCGTACCG GACGGCGTAG TAAAAATACT CGGTGAGTCT GAAATAGAAA TATTCGGACG TGCAGACGTT ACTATTAATA GCGAAAACGT TGTCAATATT CACGGTGGAA AATTAATTAA CGCGGACGCT GATATTATTA ATGTCACAGC GACAGATACA ATTAACGCAC ATGCTGATTT AGTGAACGTT ATAGCAACGA GTTCTGTTAG TGTTACTGCT GCGAACAGAA TATCGCTGAC AGCTCAAACA ATCAGTGCGT GGGCTCCGGG TGGGATAACA CTAGCTGGTC CAACTCATAT TACCGAGACA TTAGTAGTAG ATAAATTAGC GACATTCCGT AACGATATTT CTGTCACTGG AGATAACGGT GGAACCGGTA ATATCACAAC TCGCGGTAGT GTGTTAGCAG GGCAAGATGT GCAGGACCGA CAAGGCACGA TGAACGAGAT GCGCATTACA TATAACGGGC ATGTGCACAC CTGTCCTGAC GGCGAAACAG ATAAACCAAA TCAACAAATG GCGTAA
|
Protein sequence | MAQSDDDEYA AAENARRLRD AVKRGTIAAV QMKPPRCRVS FGGEHQSGWL QWFTHATSER IDWSAPSVGD PVTVVSEGGD TRNGVVMLGL HIDNVDPPSS DPHDHVTAYC DGARLTYNTK NHTLTWQGVP DGVVKILGES EIEIFGRADV TINSENVVNI HGGKLINADA DIINVTATDT INAHADLVNV IATSSVSVTA ANRISLTAQT ISAWAPGGIT LAGPTHITET LVVDKLATFR NDISVTGDNG GTGNITTRGS VLAGQDVQDR QGTMNEMRIT YNGHVHTCPD GETDKPNQQM A
|
| |