Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2084 |
Symbol | |
ID | 5594495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2070534 |
End bp | 2071682 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921225 |
Product | putative baseplate J protein |
Protein accession | YP_001458769 |
Protein GI | 157161451 |
COG category | [S] Function unknown |
COG ID | [COG3299] Uncharacterized homolog of phage Mu protein gp47 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 0.733607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTA AGCGGAAAAC ACTGAGCGAG CTCCGGCAGG AGAATCGCCA GTTTATGCAG GCAGAGCTTG AAAGTGTTGG CGCGCTGTTA CGGTTTGGCA ACCTTAAGGT GCTCGCTGAT ATGGATGCGG GCATGGCCCA TCTGCACTAC GCCTACCTGG ATTATATTGC CCGTCAGAGC ACGCCTTTCA CCTCTACCGA TGAGTGGCTT GCCGGATGGA TGGCGCTAAA GCAGATTTAC CGAAAAGCCG CAACGGCAGC ACGCTCTCCG GCGGCGACCG TTACCGGAAC GCCGGGGAAA ACACTGTCAA AAGGGGCGGT GTTAAACCGT GATGATGGTT ACCAGTACAC AACCGATGAT GCCATAACGA TAAACACTGC TGGCAGTGCG TCGGTTGCTG TAACAGCGGT TTTGCCGGAT ATCACAGACG ATGTGACAGG AGGAGGCGCT TCCGGAAATG CCGATGCTGG CACCATTCTT ACACTGGATG CTAATGCCCC CGGCATAGAC AGCTCGGTTA CGCTGATTGA GCCCGCCACC GGCGGCGCCA ACATTGAAAG TGAAGAGGAT TTCCGATTAC GTGGTCTGCT GGCGTATCAG AATCCCCCGC AGGGCGGGAG TGACACTGAT TATAAAAGCT GGGCTTTATC CGTGTCGGGG ATTACCAGGG CATGGATACG GCGCCGGGGG ATGGGGCCGG GTACCGTGGT GATTTACATC ATGTGCGACG GCGATGATAA AACCAATCAT GGATTCCCTG TAGGAACTGA CGGTGTCTCT CAACTGGAAG AGTGGGGGGC TGTAAAAGCC ACCGGGGATC AGGGGAGAGT TGCCGATTAT ATGTATCCGC TTGCGCCGGT TACTTCCCTT AACTATGTCT GCTCTCCCAT CGAACGCGTT ATCGATTTTG AAATAGGCGG GATATCTGAT GCCGACAGCG CAACGACTGC GGCCATTGCT GATGCGATTG ACGGGGTATT GTTTGAATCC GCTAACCCGC TCGGCACAGG GAAAATTTAC CTTTCAGATC TCAACCGTGC GATAGGGGAT GTTGCCGGTA CTTCAGGTTA TATCCTTGTG TCGCCTTCTG CGAATATTGA GCCGGGAGTT GGGGAGCTGG CTGTTCGTGG TGAGGTGAAC TACACATGA
|
Protein sequence | MPFKRKTLSE LRQENRQFMQ AELESVGALL RFGNLKVLAD MDAGMAHLHY AYLDYIARQS TPFTSTDEWL AGWMALKQIY RKAATAARSP AATVTGTPGK TLSKGAVLNR DDGYQYTTDD AITINTAGSA SVAVTAVLPD ITDDVTGGGA SGNADAGTIL TLDANAPGID SSVTLIEPAT GGANIESEED FRLRGLLAYQ NPPQGGSDTD YKSWALSVSG ITRAWIRRRG MGPGTVVIYI MCDGDDKTNH GFPVGTDGVS QLEEWGAVKA TGDQGRVADY MYPLAPVTSL NYVCSPIERV IDFEIGGISD ADSATTAAIA DAIDGVLFES ANPLGTGKIY LSDLNRAIGD VAGTSGYILV SPSANIEPGV GELAVRGEVN YT
|
| |