Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4029 |
Symbol | |
ID | 5591742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4020945 |
End bp | 4022270 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640923133 |
Product | hypothetical protein |
Protein accession | YP_001460599 |
Protein GI | 157163281 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGTC AAATAATCAC AGTTGCACTC ATCCTGTTGG GGCACATATT TCTGACGCCT GCTGTACAGG CCATAGGGTT CGACTATTAC AATGATCACG GTGTTATGTC TTACGGTAAA GGTTATGGGG AGGATGAGAA AATCATTGCC CAATTTCCGA AGATGAACAG AGCCGATTTG CGCCTTGTGA CAAATATATC CGGGGAGCGG GAGTTGGTGG AAGGCTATAT ACCCACCGAT GAAATAAGCA TAAAAAATGA GGAATACCAC TGGGTGACAG ACGGCCGCGT TATTCTCTGG CGTGGCAAAA TCGTTAGCAA TCCACCGGGA ACCCCCACTG TCGATATTGC CAGCTTTCAG GCTATGGGCC GCTTCGCGGT CGATAAATAT AGCCTCTATT TTGACGGACA GCGCACCGAA AGTAATAGCG GCGCGTCACG TGTTGATCTG GCGACACTAA AAGCTATCGA AGGTAACTCT ACCACGCTGA TGGATAGTAA AAATCTCTAC TTGTCCGGTC GACGTCAGGG GAGCAGCAGT GATGTTACTG TATTAGAAAA AAGATGGTGG GGTATTAATC CACGTCTTAT GAGTGTAAAC AGAAATTCGT ATTCCAATGA TCTGCTTATC CGCAGTGGGC AGAATATTTA TTTAAATGGC GTTCACCTTA CGGCGAATGC AGACTCATTT GAGATAATTC GCTGGATACC TCATTCACTG CTGGTTTTTC GCGACAATAA GGGTCTGCAT CGTTATCCCT TTGGTCAATT ATCAGGCAAA GCGATACCCG TAGATGATGA CGTCTCTTTT GAAGTAGGGG AAAGTCGCGT TCGCTGGCGT AAACAGCTCA CGCCCGACCG TCAGTGGAGC AAGTGGATAG ACCTACCAGG TATTGAACCT GAACAATTTC ATCTGATTAC TGGCAATATT GCGCAATATA AAGATCGGCT GTATGTAACA AAATTATCGA CATTTGGTGA AGACCAGCTT GAGATAATCC CGCTGGATAC GCCAGACCTG GTCATTGATC GCTCATTTAA TAGCGGCAAA CAGCATGCTT ACTTTATCCG CCAATTACGG TCAAAGAGCT TGCAAATTAT TCCAGTTAAC GGTCCGCTAA CTAAAAACGA TCGCTTCGCT TATGACGATC GCAATGTTTA TACATGGACC GATACAGAGG TAAGGATTAC GCCCTCCCCC TGCCCGGCGA AAACTCGTGT CAGAGAGGAG AACGTACGTG AAGTTCAAAA CAGAGACATC ATTATTCCGG TGACGGATGA ATCATGCCGG AACGCAGCAG CAGAGGTGCA AACTTTGAAG CCCTGA
|
Protein sequence | MNGQIITVAL ILLGHIFLTP AVQAIGFDYY NDHGVMSYGK GYGEDEKIIA QFPKMNRADL RLVTNISGER ELVEGYIPTD EISIKNEEYH WVTDGRVILW RGKIVSNPPG TPTVDIASFQ AMGRFAVDKY SLYFDGQRTE SNSGASRVDL ATLKAIEGNS TTLMDSKNLY LSGRRQGSSS DVTVLEKRWW GINPRLMSVN RNSYSNDLLI RSGQNIYLNG VHLTANADSF EIIRWIPHSL LVFRDNKGLH RYPFGQLSGK AIPVDDDVSF EVGESRVRWR KQLTPDRQWS KWIDLPGIEP EQFHLITGNI AQYKDRLYVT KLSTFGEDQL EIIPLDTPDL VIDRSFNSGK QHAYFIRQLR SKSLQIIPVN GPLTKNDRFA YDDRNVYTWT DTEVRITPSP CPAKTRVREE NVREVQNRDI IIPVTDESCR NAAAEVQTLK P
|
| |