Gene EcHS_A2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2123 
Symbol 
ID5594586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2107746 
End bp2109359 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content54% 
IMG OID640921262 
ProductIS66 family transposase 
Protein accessionYP_001458801 
Protein GI157161483 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AATACCTCAT TCGCATCGCA GAGCTGGAAA GGTTGCTCTC TGAGCAGGCT 
GAAGCCCTCC GTCAGAAAGA CCAGCAACTG AGTCTGGTTG AAGAGACGGA AGCCTTCCTG
CGCTCTGCAC TGACACGTGC CGAAGAAAAG ATCGAAGAAG ATGAACGGGA AATAGAACAT
CTGCGGGCTC AGATAGAAAA ACTGCGCCGG ATGCTGTTCG GTACCCGTTC TGAAAAACTG
CGTCGTGAAG TTGAACTGGC TGAGGCTCTG CTGAAACAAC GTGAACAGGA CAGCGATCGT
TACAGTGGGC GGGAAGACGA TCCTCAGGTT CCCCGCCAGT TGCGACAGTC GCGCCATCGT
CGTCCGTTAC CGGCACACCT TCCCCGTGAA ATACACCGCC TGGAGCCAGA AGAAAGCTGT
TGCCCGGAGT GTGGCGGTGA GCTGGATTAT CTGGGGGAAG TCAGCGCTGA ACAGCTGGAA
CTGGTGAGCA GTGCCCTGAA AGTGATCCGC ACAGAACGGG TAAAAAAAGC CTGTACAAAA
TGTGACTGTA TTGTTGAAGC ACCGGCGCCG TCCCGCCCGA TAGAGCGTGG TATCGCGGGC
CCCGGATTAC TTGCCCGCGT GTTAACGGGA AAATACTGCG AACATCTGCC ACTGTATCGT
CAGAGTGAAA TCTTTGCCCG CCAGGGTGTC GAACTGAGCC GGGCCTTACT CTCCAACTGG
GTTGACGCGT GCTGCCAGTT AATGACACCG GTGAATGATG CCCTGTACCG TTATGTAATG
AACACCCGCA AGATTCACAC TGATGACACA CCGGTAAAGG TACTGGCACC GGGTCAGAAA
AAGGCGAAAA CAGGGCGTAT CTGGACGTAT GTCCGGGATG ATCGCAATGT GGGTTCGTCA
TCTCCTCCAG CGGTCTGGTT CGCGTACTCG CCGAACCGGC AGGGGAAACA CCCGGAGCAA
CACCTCCGCC CCTTCCGGGG TATCCTGCAG GCGGATGCGT TCACAGGTTA CGACAGGTTG
TTCAGTGCAG AACGTGAAGG TGGTGCACTG ACAGAAGTTG CGTGCTGGGC CCATGCCCGG
CGAAAAATCC ACGATGTATA CATCAGCAGC AAAAGTGCGA CGGCAGAAGA AGCACTGAAG
CGAATCAGTG AACTGTACGC CATCGAGGAT GAAATACGGG GATTACCGGA GTCAGAGCGT
CTTGCCGTCA GGCAGCAGCG AAGCAAAGTG TTACTGACGT CGCTGCATGA ATGGATGGTG
GAGAAGAATG GTACGCTGTC GAAAAAATCC AGACTGGGCG AAGCGTTCAG CTATGTACTG
AATCAGTGGG ATGCCCTCTG TTATTACAGT GATGACGGTC TGGCGGAGGC GGATAATAAT
GCTGCGGAAA GAGCGCTTCG TGCAGTCTGT CTCGGAAAGA AAAACTTTAT GTTCTTTGGC
AGCGATCACG GCGGCGAGCG TGGAGCACTG TTGTACGGGC TGATCGGCAC CTGCCGTCTG
AACGGTATCG ATCCGGAAGC GTATCTGCGC CATATCCTGA GCGTACTGCC GGAATGGCCT
TCCAACCGAG TTGATGAACT CCTGCCATGG AACGTAGTAC TCACCAATAA ATAA
 
Protein sequence
MSQKYLIRIA ELERLLSEQA EALRQKDQQL SLVEETEAFL RSALTRAEEK IEEDEREIEH 
LRAQIEKLRR MLFGTRSEKL RREVELAEAL LKQREQDSDR YSGREDDPQV PRQLRQSRHR
RPLPAHLPRE IHRLEPEESC CPECGGELDY LGEVSAEQLE LVSSALKVIR TERVKKACTK
CDCIVEAPAP SRPIERGIAG PGLLARVLTG KYCEHLPLYR QSEIFARQGV ELSRALLSNW
VDACCQLMTP VNDALYRYVM NTRKIHTDDT PVKVLAPGQK KAKTGRIWTY VRDDRNVGSS
SPPAVWFAYS PNRQGKHPEQ HLRPFRGILQ ADAFTGYDRL FSAEREGGAL TEVACWAHAR
RKIHDVYISS KSATAEEALK RISELYAIED EIRGLPESER LAVRQQRSKV LLTSLHEWMV
EKNGTLSKKS RLGEAFSYVL NQWDALCYYS DDGLAEADNN AAERALRAVC LGKKNFMFFG
SDHGGERGAL LYGLIGTCRL NGIDPEAYLR HILSVLPEWP SNRVDELLPW NVVLTNK