Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2656 |
Symbol | |
ID | 5591888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2667104 |
End bp | 2668291 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640921771 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001459298 |
Protein GI | 157161980 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.000325439 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGACTG ACACCAGGCT GCGTCACCTT AAGCCGAAGG AGAAACTCTA TAAAGTTAAT GACCGTGATG GTCTGTATGT TGCGGTCACT CCGGCTGGAA CGATCTCATT TCGTTATAAC TATTCAATAA ACGGAAGACA GGAGACCGTT ACTTTTGGCC GCTATGGTGT GGGAGGGATC ACGCTTGCAG AAGCGCGTGA ACGGCTCAAT GAAGCTAAAA AAATGGTTGC CGGTGGAAGA TCTCCAGCGA GGGAAAAAGC TAGAGATAAA GCGCGTATCA AAGATGCGGA GACTTTTGGT GCATGGGCTG AGAAATGGTT ACGCGGCTAT CAAATGGCTG AATCGACGCG TGATATGCGG CGTTCGGTAT ATCAAAGGGA GTTGAAGTCT AAATTTGCGC AGCAGAAATT GGCCGAGATT ACACATGAAG ACTTACGCGC ATTAACCGAT AACATTGTCG AGAGAGGGGC ACCGGCGACA GCTGTACACG CCAGAGAGAT TGTATTGCAA GTCTATCGCT GGGCTATTGA GCGCGGTCAG AAGGTTGAGA ATCCCGCTGA TCTGGTACGG CCTGCAAGCA TAGCGAAATT TGAGCCTCGT GACAGGGCAT TGACGCCAGT TGAAATTGGT CTGATGTATC GGTACATGGA ACGGGTAGGA ACGACGCCAT CAATCAGAGC AGCGGTTAAA CTTTTGCTGT TAACGATGGT ACGTAAAAGT GAGCTTACTA ACGCAACCTG GAACGAGATC AATTTTAGTG AAGCATTATG GACGATACCA AAGGAGAGGA TGAAAAGACG TAATCCACAT TTGGTTTTTC TTTCCAGACA GGCAATGGAT ATCATGATTG CTCTCAAAAC TTTTGCCGGT AGCTCTGATT TTATCCTTCC TTCGCGGTAC GATTCCGATG CGCCTATGAG TAGTGCTACT TTAAACCGGG TTTTGACCTT GACGTATCGC CTGGCACAGA AAGAAGGGGA GTCATTGCCG AAGTTTGGTC CTCATGACTT ACGGCGTACA GCCAGCACTT TGCTACATGA GGCCGGGTAC AATACTGACT GGATCGAAAA ATGCCTGGCA CATGAACAGA AAGGGGTTAG GGCGGTATAC AACAAAGCTG AATATCGAGA GCAAAGAGCA TCAATGCTAC AGGATTGGGC TGATATGATA GATGAATGGG TTGCGTAA
|
Protein sequence | MLTDTRLRHL KPKEKLYKVN DRDGLYVAVT PAGTISFRYN YSINGRQETV TFGRYGVGGI TLAEARERLN EAKKMVAGGR SPAREKARDK ARIKDAETFG AWAEKWLRGY QMAESTRDMR RSVYQRELKS KFAQQKLAEI THEDLRALTD NIVERGAPAT AVHAREIVLQ VYRWAIERGQ KVENPADLVR PASIAKFEPR DRALTPVEIG LMYRYMERVG TTPSIRAAVK LLLLTMVRKS ELTNATWNEI NFSEALWTIP KERMKRRNPH LVFLSRQAMD IMIALKTFAG SSDFILPSRY DSDAPMSSAT LNRVLTLTYR LAQKEGESLP KFGPHDLRRT ASTLLHEAGY NTDWIEKCLA HEQKGVRAVY NKAEYREQRA SMLQDWADMI DEWVA
|
| |