Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0237 |
Symbol | |
ID | 5595040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 256897 |
End bp | 258372 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919424 |
Product | hypothetical protein |
Protein accession | YP_001457011 |
Protein GI | 157159693 |
COG category | [S] Function unknown |
COG ID | [COG3517] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03355] type VI secretion protein, EvpB/VC_A0108 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 68 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTAC AGGAAGAGGA ACTCGTTTCC AGTCATGCCG GGCAGCCTGA GCAGGCATCG TCGCTGCTTG ATCAAATCAT GGCACAGACC CGTATTCAGC CCGGTTCGGA AGGTTATGAC GTTGCCCGCC AGGGTGTGAC AGCCTTTATT GCCAGCATCC TGCAATCCAC TGCATCTGCC GAACCGGTCA ATAAACTGGC CGTGGACAGC ATGATTGCAG ATATTGATGA ACGCATCAGC CGCCAGATGG ATGTCATCAT CCATGCGCCT GCCTTTCAGC AGGTCGAATC GTTCTGGCGC TCACTGAAAA CCATGGTGGA TCGCGTTGAT TTCCGTGAAA ACATCAAGGT CAACGTGCTT CATGTCACCA AGCAGGAACT TCTGGAAGAC TTTGAGTTTG CGCCAGAAAT TATTCAGTCC GGATTTTATA AGCACGTTTA TTCGTCCGGT TTCGGACAGT TTGGCGGTGA ACCCATCGCT GCCGTGCTGG GCGCTTATGA GTTCAAAAAT ACCGCGCCTG ACATGAAACT GCTGCAATAC GTCAGTGCCG TGGGCGCAAT GGCGCACGCA CCGTTCCTGT CCTCCGTTTC CCCGGAGTTT ATGGGGCTGA ACTCGTGGAC CGAACTGCCT AATATCAAAG ATCTTTATGC CATCTTTGAA GGTCCGGCCT ACACCAAATG GCGCGCCCTG CGTGACTCGG AAGATTCGCG CTATTTAGGG CTTACGGCAC CCCGCTTCCT GCTTCGCCAG CCCTATTCAC CAACAGATAA CCCTGTTAAG AACTTTAACT ATTACGAAGA TGTCAGCCAG AACCACGAAG ATTATCTGTG GGGTAATACG GCCTGGATGC TGGCATGCAA TATCGCAGAC AGTTTTGCCA AATACCGCTG GTGTCCAAAC ATTATCGGTC CGCAAAGCGG CGGCGCAGTG AAAGATCTGC CGGTGCATCT GTTCGAAACG ATGGGGCAGA TTCAGGCCAA AATCCCAACC GAGGTGCTGG TCACCGATCG CCGCGAATTT GAACTGGCTG AAGAGGGTTT TATCACCCTT ACCATGCGTA AAGACTCTGA TAATGCAGCC TTTTTCTCTG CAAACTCGGT ACAAAAACCG AAGCACTTTC CGGGAAAAGA TGCAGAAACC AACTATAAGC TGGGTACGCA GCTTCCTTAT CTCTTCATCA TCAATCGGTT AGCGCACTAC ATCAAGGTGT TGCAGCGTGA ACAACTGGGG TCATGGAAAG AAAGGAGCGA CTTAGAAAGA GAGCTTAATA CCTGGATCCG TCAGTATGTT GCCGACCAGG AAAACCCGCC TGCAGACGTG CGCAGCCGCA AACCTCTGCG CGCTGCCAAA GTTGAGGTTA TGGATGTAGA AGGCGAACCC GGCTGGTACC AGGTTGCGCT AAGCGTGAGG CCTCATTTTA AATTTATGGG GGCAAATTTT GAGCTTTCCC TGGTTGGCCG GTTAGACAGG GAGTAA
|
Protein sequence | MSLQEEELVS SHAGQPEQAS SLLDQIMAQT RIQPGSEGYD VARQGVTAFI ASILQSTASA EPVNKLAVDS MIADIDERIS RQMDVIIHAP AFQQVESFWR SLKTMVDRVD FRENIKVNVL HVTKQELLED FEFAPEIIQS GFYKHVYSSG FGQFGGEPIA AVLGAYEFKN TAPDMKLLQY VSAVGAMAHA PFLSSVSPEF MGLNSWTELP NIKDLYAIFE GPAYTKWRAL RDSEDSRYLG LTAPRFLLRQ PYSPTDNPVK NFNYYEDVSQ NHEDYLWGNT AWMLACNIAD SFAKYRWCPN IIGPQSGGAV KDLPVHLFET MGQIQAKIPT EVLVTDRREF ELAEEGFITL TMRKDSDNAA FFSANSVQKP KHFPGKDAET NYKLGTQLPY LFIINRLAHY IKVLQREQLG SWKERSDLER ELNTWIRQYV ADQENPPADV RSRKPLRAAK VEVMDVEGEP GWYQVALSVR PHFKFMGANF ELSLVGRLDR E
|
| |