Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4527 |
Symbol | |
ID | 5592199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4532707 |
End bp | 4534656 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640923623 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001461063 |
Protein GI | 157163745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 0.259982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGC GTTATAACCT CTATCGTCGA ACCAGCGGCA TTTATGTTGT CCGCATTAGC GTTCCCCAAC GTTTTCGTCG ATACGCAGGA CAGTGTGAGA TTCACACTTC AACAGGCACT CATGATCTTC ATGAAGCAAA GCAGAAGTCC GCGCTCCTGT TGGCTGTCTG GTATCAGACC TTACAAGAGT ATGAACAATT GGATTACCGA ACTTTAAGTG ACTGCGCCCC GCTGCTTGCT GGTGAGGGGA TGATCTCGCT TTCTAACTTT GCTCAGTCAA TCGAGTTGCC TATATCGCAA TTGATTCGAG AGGTGATTAA TCGTAACCTC CCGGTATTCT GGCTGGCGAC TGGTCAGTTC GGTTTCTATG TTGATGAATT TAATGCAGTA GAGCGGGAAC CCGGTGCAAA ACGAGAAAAA CAGTCTGATG ATGAAAAGGA TCAACCTAAA GAAGTCATCA TTCTCAATAG CGCGTTTGAG CTGGGTATCG AGAGCTTCGC AAATGGTTAT CTCCGCCCCT TCAATCCCCG GCATACTTTA GATTGTCTGT TGAGCGCTGG AGTATCCGAA GGAGAGGCTG CATTTCGAAC TAGTGGTGAT AACCAAAGTG GAGGTTGGTT CTTCGATTTA CCCGGCGTAG ATATAACTGC TGATAGCCTC TTGATTAGCA AAGTTCATGC TGAAGGCCTT CGACTTACAT GGCTGGTTAA GACCACGCCA CCAGCAGTTA GCATTCACCC TGCCGTGCCT CTTGTCGCCC CTGTTATCGC TAACGAATAT GTTCACCGCA AACATTACAA TGAAAACTTG TCATGGCTTC GTGAAGAGTA TTTGAAACAT CGGCGTAAGG GCAAGGTATC AGAAGCGGCG CTCCGCGATA TTCGCTATTA CTTCGATTTG ATGATCGAAG TGATGGGGGA TATTCAGTTG GAAGATTTCG ACCGTGATTT CCTCCGGGCT TATGAGAGCA AGTTGCGCAC AATTCCTGCT AACCGTAATT TGATGAAAGG TAAGCACGGG GTTAAGACGC TGGATGAGTT AATCGCCAAA GCGGCAGAAT GTGGCGATAA ACTGATGACA GAAGAGTCTG TCAAAAAGTA TATCAACGGC CTTTATGGTG CAATGGAGTG GGCTGTTGAT GACGGTAAGT TTCTGAAATC GCCATGCGAC AACTTTTTCC CTCCCGATGA CAAAGGTGAG CGAGAGCAGG ATCACACTGA CATATTTGAA CCGCATGAAA TTAAGGCAAT TTTTTCGCAA CCGTGGTTTG TGGCTGGAAC TGTTGAACGT AATGCGCAAG GGCGATTCCA TCAATATTGC CCGTTTCACT ATTGGGCGCC GTTGTTGGGC TTGATGACGG GGGCAAGGGT TAACGAGATT GCACAGTTAA TGCTGGACGA TGTTCTGGCA GATGACGGCG TTTATTACCT GAACCTTGAA AGCGATAGCG AAAACGGAAA GAAACTAAAA AACGCCCAAT CCCGCCGCAA GATTCCGGTT CATTCTACGC TGATTGAACT CGGTTTTATC GAGTATGTGG ATGCGTTGAA AGCTGCCGGG TATGACCGTC TTTTTCCCGA GCTTAAACCA CATAAAACTA AAGGCTATGG TAGGCCGGTT TCCGCATGGT TCAATGAATC ATTGCTTGCG GGTCGATTAA AACTTGAAAG AGACAGAAGC AAATCTTTCC ACTCTTTCCG GCATTCTGTT TCAACTTTGC TTAAAGAGAA GGGTGTTAGT TCGGAACTGC GTGGGCAGCT ACTTGGGCAT GTGCGCGGCA AAACAGAAAC TGAAGTGCGA TACAGCAAAG ATTTAAAACC GGTTCACATG GTTGAGGTTG TCGAAAAGAT TGATTTTTCT TTGCCCGAGA TAGCGAGATT CAACATTCCT GATGGGCTGG ATGCTGTAAG TGATGCGCTG CGAAGAAAGC GTGGCAAACA AACAGGTTGA
|
Protein sequence | MSQRYNLYRR TSGIYVVRIS VPQRFRRYAG QCEIHTSTGT HDLHEAKQKS ALLLAVWYQT LQEYEQLDYR TLSDCAPLLA GEGMISLSNF AQSIELPISQ LIREVINRNL PVFWLATGQF GFYVDEFNAV EREPGAKREK QSDDEKDQPK EVIILNSAFE LGIESFANGY LRPFNPRHTL DCLLSAGVSE GEAAFRTSGD NQSGGWFFDL PGVDITADSL LISKVHAEGL RLTWLVKTTP PAVSIHPAVP LVAPVIANEY VHRKHYNENL SWLREEYLKH RRKGKVSEAA LRDIRYYFDL MIEVMGDIQL EDFDRDFLRA YESKLRTIPA NRNLMKGKHG VKTLDELIAK AAECGDKLMT EESVKKYING LYGAMEWAVD DGKFLKSPCD NFFPPDDKGE REQDHTDIFE PHEIKAIFSQ PWFVAGTVER NAQGRFHQYC PFHYWAPLLG LMTGARVNEI AQLMLDDVLA DDGVYYLNLE SDSENGKKLK NAQSRRKIPV HSTLIELGFI EYVDALKAAG YDRLFPELKP HKTKGYGRPV SAWFNESLLA GRLKLERDRS KSFHSFRHSV STLLKEKGVS SELRGQLLGH VRGKTETEVR YSKDLKPVHM VEVVEKIDFS LPEIARFNIP DGLDAVSDAL RRKRGKQTG
|
| |