Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0225 |
Symbol | |
ID | 5591431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 239808 |
End bp | 241160 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640919412 |
Product | ImpA domain-containing protein |
Protein accession | YP_001456999 |
Protein GI | 157159681 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 0.347157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTA ACGTACTGAC ACAAACTATC GTTACCGGCA GTGACCCGCG CGGGCTGCCG GAATTCAGTG CCATCCGCGA GGAAATAAAC AAAGCCAGCC ACCCGTCACA GCCTGAGCTG AACTGGAAAC TGGTGGAGTC GCTGGCACTG GCGATTTTTA AAGCCAACGG TGTGGATTTA CACACCGCCA CCTACTATAC GCTTGCCCGG ACACGGACAC AGGGACTGGC GGGATTCTGC GAAGGTGCGG AACTGCTGGC GGCAATGGTA AGCCACGACT GGGATAAGTT CTGGCCGCAG GGCGGCCCGG CGCGTACTGA AATGCTGGAC TGGTTTAACT CCCGAACCGG CAATATTCTG CGTCAGCAAA TCTCCTTTGC GGAATCCGAC CTACCACTGA TCTACCGCAC AGAGCGGGCA TTGCAGCTTA TCTGCGATAA GCTCCAGCAG GTGGAACTGA AGCGCGTTCC GCGCGTGGAG AATCTGCTCT ATTTTATGCA GAACACGCGT AAACGGCTTG AACCCCAGCT GAAGAGTAAC ACTGAGAACG CCGCACAGAC CACGGTCAGA ACGCTGATTT ATGCCCCGGA AACACAGGCA TCTTCCACAC CAGAAGCGGT AGTGCCTCCC CTGCCCGGCC TGCCTGAGAT GAAAGTGGAA GTGCGCAGTC TGACAGAGAA TCCCCCACAG GCCAGTGTGA TAAAGCAAGG CAGTACGGTA AGAGGGTTTA TCGCAGGGAT CGCCTGTTCA GTGGCTGTCG CCTCAGCATT GTGGTGGTGG CAGGTCTATC CGGTACAGCA GCAACTGTTA CAGGTTAACG ACACCGCTCA GGGCGCAGCA ACGGTGTGGA TGGCCTCACC TGAACTCGAA AACTATGAGC GCAGGCTGCA ACAACTTCTT GATACCTCCC CGGTACAGCC GCTGGAAACC GGGATGCAGA TGATGCGTGT TGCCGACAGT CGCTGGCCGG AAAGCCTGCA ACAGCAACAG GCCTCGACAC AATGGAATGA GGCACTCAAA ACCCGCGCAC AGAGTAGCCC GCAGTTGCGT GGCTGGTTGC AGACCCGCCA GGACTTACAT GCTTTTGCAG ATCTAGTGAT GCAGCGCGAG AAAGAGGGAC TAACCCTTTC CTATATCAAA AATGTCATCT GGCAGGCGGA GCGGGGACTG GGGCAGGAAA CACCCGTTGA GTCTCTGTTG ACGCAGTACC AGGATGCCCG TGCGCAGAAG CAGAATACAG ATGCGCTGGA AAAACAAATT AATGAGCGAC TCGAAGGCGT GTTAAGCCGC TGGCTGCTGC TGAAGAATAA CGTCATGCCA GAGGCGGCAA CCGGTACCAC GGCCGAAAAA TAA
|
Protein sequence | MNSNVLTQTI VTGSDPRGLP EFSAIREEIN KASHPSQPEL NWKLVESLAL AIFKANGVDL HTATYYTLAR TRTQGLAGFC EGAELLAAMV SHDWDKFWPQ GGPARTEMLD WFNSRTGNIL RQQISFAESD LPLIYRTERA LQLICDKLQQ VELKRVPRVE NLLYFMQNTR KRLEPQLKSN TENAAQTTVR TLIYAPETQA SSTPEAVVPP LPGLPEMKVE VRSLTENPPQ ASVIKQGSTV RGFIAGIACS VAVASALWWW QVYPVQQQLL QVNDTAQGAA TVWMASPELE NYERRLQQLL DTSPVQPLET GMQMMRVADS RWPESLQQQQ ASTQWNEALK TRAQSSPQLR GWLQTRQDLH AFADLVMQRE KEGLTLSYIK NVIWQAERGL GQETPVESLL TQYQDARAQK QNTDALEKQI NERLEGVLSR WLLLKNNVMP EAATGTTAEK
|
| |