Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2123 |
Symbol | |
ID | 5594586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2107746 |
End bp | 2109359 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921262 |
Product | IS66 family transposase |
Protein accession | YP_001458801 |
Protein GI | 157161483 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGA AATACCTCAT TCGCATCGCA GAGCTGGAAA GGTTGCTCTC TGAGCAGGCT GAAGCCCTCC GTCAGAAAGA CCAGCAACTG AGTCTGGTTG AAGAGACGGA AGCCTTCCTG CGCTCTGCAC TGACACGTGC CGAAGAAAAG ATCGAAGAAG ATGAACGGGA AATAGAACAT CTGCGGGCTC AGATAGAAAA ACTGCGCCGG ATGCTGTTCG GTACCCGTTC TGAAAAACTG CGTCGTGAAG TTGAACTGGC TGAGGCTCTG CTGAAACAAC GTGAACAGGA CAGCGATCGT TACAGTGGGC GGGAAGACGA TCCTCAGGTT CCCCGCCAGT TGCGACAGTC GCGCCATCGT CGTCCGTTAC CGGCACACCT TCCCCGTGAA ATACACCGCC TGGAGCCAGA AGAAAGCTGT TGCCCGGAGT GTGGCGGTGA GCTGGATTAT CTGGGGGAAG TCAGCGCTGA ACAGCTGGAA CTGGTGAGCA GTGCCCTGAA AGTGATCCGC ACAGAACGGG TAAAAAAAGC CTGTACAAAA TGTGACTGTA TTGTTGAAGC ACCGGCGCCG TCCCGCCCGA TAGAGCGTGG TATCGCGGGC CCCGGATTAC TTGCCCGCGT GTTAACGGGA AAATACTGCG AACATCTGCC ACTGTATCGT CAGAGTGAAA TCTTTGCCCG CCAGGGTGTC GAACTGAGCC GGGCCTTACT CTCCAACTGG GTTGACGCGT GCTGCCAGTT AATGACACCG GTGAATGATG CCCTGTACCG TTATGTAATG AACACCCGCA AGATTCACAC TGATGACACA CCGGTAAAGG TACTGGCACC GGGTCAGAAA AAGGCGAAAA CAGGGCGTAT CTGGACGTAT GTCCGGGATG ATCGCAATGT GGGTTCGTCA TCTCCTCCAG CGGTCTGGTT CGCGTACTCG CCGAACCGGC AGGGGAAACA CCCGGAGCAA CACCTCCGCC CCTTCCGGGG TATCCTGCAG GCGGATGCGT TCACAGGTTA CGACAGGTTG TTCAGTGCAG AACGTGAAGG TGGTGCACTG ACAGAAGTTG CGTGCTGGGC CCATGCCCGG CGAAAAATCC ACGATGTATA CATCAGCAGC AAAAGTGCGA CGGCAGAAGA AGCACTGAAG CGAATCAGTG AACTGTACGC CATCGAGGAT GAAATACGGG GATTACCGGA GTCAGAGCGT CTTGCCGTCA GGCAGCAGCG AAGCAAAGTG TTACTGACGT CGCTGCATGA ATGGATGGTG GAGAAGAATG GTACGCTGTC GAAAAAATCC AGACTGGGCG AAGCGTTCAG CTATGTACTG AATCAGTGGG ATGCCCTCTG TTATTACAGT GATGACGGTC TGGCGGAGGC GGATAATAAT GCTGCGGAAA GAGCGCTTCG TGCAGTCTGT CTCGGAAAGA AAAACTTTAT GTTCTTTGGC AGCGATCACG GCGGCGAGCG TGGAGCACTG TTGTACGGGC TGATCGGCAC CTGCCGTCTG AACGGTATCG ATCCGGAAGC GTATCTGCGC CATATCCTGA GCGTACTGCC GGAATGGCCT TCCAACCGAG TTGATGAACT CCTGCCATGG AACGTAGTAC TCACCAATAA ATAA
|
Protein sequence | MSQKYLIRIA ELERLLSEQA EALRQKDQQL SLVEETEAFL RSALTRAEEK IEEDEREIEH LRAQIEKLRR MLFGTRSEKL RREVELAEAL LKQREQDSDR YSGREDDPQV PRQLRQSRHR RPLPAHLPRE IHRLEPEESC CPECGGELDY LGEVSAEQLE LVSSALKVIR TERVKKACTK CDCIVEAPAP SRPIERGIAG PGLLARVLTG KYCEHLPLYR QSEIFARQGV ELSRALLSNW VDACCQLMTP VNDALYRYVM NTRKIHTDDT PVKVLAPGQK KAKTGRIWTY VRDDRNVGSS SPPAVWFAYS PNRQGKHPEQ HLRPFRGILQ ADAFTGYDRL FSAEREGGAL TEVACWAHAR RKIHDVYISS KSATAEEALK RISELYAIED EIRGLPESER LAVRQQRSKV LLTSLHEWMV EKNGTLSKKS RLGEAFSYVL NQWDALCYYS DDGLAEADNN AAERALRAVC LGKKNFMFFG SDHGGERGAL LYGLIGTCRL NGIDPEAYLR HILSVLPEWP SNRVDELLPW NVVLTNK
|
| |