Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1924 |
Symbol | |
ID | 5592628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1935235 |
End bp | 1936518 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921067 |
Product | PqiA family integral membrane protein |
Protein accession | YP_001458618 |
Protein GI | 157161300 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0000000171721 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTTA ACACACCACA AATTACGCCG ACAAAAAAGA TAACAGTGAG GTCAATCGGC GAGGAACTGC CGCGTGGTGA TTACCAACGT TGCCCGCAAT GTGACATGCT GTTTAGCCTG CCCGAGATAA ATTCTCATCA GAGTGCCTAT TGTCCGCGCT GTCAGGCAAA AATTCGCGAC GGGCGCGACT GGTCGCTAAC GCGCCTGGCG GCAATGGCTT TCACCATGCT GTTGCTGATG CCGTTTGCCT GGGGCGAACC GCTGTTGCAT ATCTGGCTGT TAGGCATTCG TATTGACGCC AACGTTATGC AAGGCATCTG GCAAATGACC AAACAGGGCG ATGCGATAAC GGGGTCGATG GTCTTTTTCT GCGTTATCGG TGCCCCCCTC ATTCTGGTGA CCTCCATAGC TTATTTATGG TTTGGTAACC GACTGGGAAT GAATCTACGC CCGGTACTGC TGATGCTTGA GCGACTTAAA GAGTGGGTAA TGCTGGATAT CTACCTGGTC GGCATTGGCG TTGCTTCTAT AAAGGTACAG GATTATGCCC ATATCCAGGC GGGTGTGGGC TTGTTCTCTT TTGTGGCGTT GGTGATTTTA ACGACGGTGA CGTTGTCACA TCTTAATGTC GAGGAGCTGT GGGAGCGATT TTATCCGCAG CGCCCCGCTA CGCGTAGGGA CGAGAAACTC CGTGTCTGTC TTGGGTGCCA TTTTACCGGC TATCCTGATC AGCGTGGTCG CTGCCCGCGT TGCCATATCC CGCTACGCGT GCGTCGCCGT CATAGTTTGC AAAAATGCTG GGCGGCGCTG TTAGCGTCAA TCGTTTTGTT GTTACCTGCC AACCTGTTGC CTATTTCTAT CATTTATCTG AACGGAGGAC GGCAGGAAGA TACAATTCTT TCCGGAATTA TGTCGCTGGC AAGTAGCAAC ATTGCGGTTG CGGGAATCGT GTTTATCGCC AGTATTCTGG TACCGTTTAC TAAAGTGATC GTCATGTTCA CTTTACTGTT GAGCATTCAT TTTAAATGCC AGCAAGGTTT ACGCACACGC ATTCTGTTAC TGCGGATGGT GACCTGGATT GGTCGCTGGT CGATGCTCGA CCTGTTTGTC ATATCTTTAA CCATGTCGCT GATTAATCGC GATCAGATCC TCGCTTTTAC TATGGGACCG GCTGCGTTTT ATTTCGGCGC AGCGGTAATT TTGACTATTC TTGCTGTGGA ATGGCTGGAC AGCCGCTTAC TTTGGGATGC ACATGAGTCA GGAAACGCCC GCTTCGACGA CTGA
|
Protein sequence | MALNTPQITP TKKITVRSIG EELPRGDYQR CPQCDMLFSL PEINSHQSAY CPRCQAKIRD GRDWSLTRLA AMAFTMLLLM PFAWGEPLLH IWLLGIRIDA NVMQGIWQMT KQGDAITGSM VFFCVIGAPL ILVTSIAYLW FGNRLGMNLR PVLLMLERLK EWVMLDIYLV GIGVASIKVQ DYAHIQAGVG LFSFVALVIL TTVTLSHLNV EELWERFYPQ RPATRRDEKL RVCLGCHFTG YPDQRGRCPR CHIPLRVRRR HSLQKCWAAL LASIVLLLPA NLLPISIIYL NGGRQEDTIL SGIMSLASSN IAVAGIVFIA SILVPFTKVI VMFTLLLSIH FKCQQGLRTR ILLLRMVTWI GRWSMLDLFV ISLTMSLINR DQILAFTMGP AAFYFGAAVI LTILAVEWLD SRLLWDAHES GNARFDD
|
| |