Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0272 |
Symbol | |
ID | 5594619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 291831 |
End bp | 292991 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640919458 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001457044 |
Protein GI | 157159726 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATCA AACTACGCGG TGGCACGTGG CACTGCGATT TCGTCGCGCC AGATGGATCA AGAGTTAGAC GCTCTCTTGA AACATCGGAC AAAAGGCAAG CGCAAGAACT TCACGATCGT CTGAAAGCAG AAGCGTGGAG AGTAAAAAAT CTCGGGGAAT CACCGAAAAA GCTATTCAAG GAAGCCTGCA TACGGTGGCT GCGTGAGAAA TCGGATAAGA AGTCCATTGA TGATGACAAG AGCATTATAT CGTTCTGGAT GTTGCACTTC AGAGAAACCA TTCTCTCAGA CATAACAACA GAAAAAATAA TGGAGGCGGT AGACGGGATG GAAAACCGCC GCCATCGCCT GAACTGGGAA ATGAGCCGGG ACAGGTGTTT GCGGCTTGGC AAGCCAGTGC CGGAGTATAA ACCAAAGCTG GCAAGCAAAG GAACGAAGAC GCGGCATCTG GCAATACTTC GCGCTATTCT CAATATGGCT GTTGAATGGG GATGGCTTGA CAGGGCGCCC AAAATATCAA CACCACGCGT TAAGAATGGA CGAATCAGAT GGCTTACAGA GGAGGAATCG AAGCGCCTGT TTGCAGAAAT TGCTCCTCAT TTCTTCCCTG TGGTCATGTT TGCAATCACG ACAGGCCTTC GCCGTTCCAA CGTTACAGAC CTTGAGTGGT CACAGGTCGA TCTGGATAAG AAAATGGCAT GGATGCACCC TGATGAAACA AAAGCTGGCA ATGCGATCGG AGTTCCTCTT AACGAAACCG CATGCCAGAT ATTAAGAAAA CAGCAGGGTC TCCATAAGAG ATGGGTGTTT GTCCACACCA AACCTGCCTA CCGAAGCGAC GGAACAAAAA CAGCAGCGGT AAGGAAGATG AGAACCGACA GCAACAAGGC ATGGAAGGGA GCGTTAAAGC GGGCAGGCAT TAGCAACTTC CGCTTCCATG ACCTGAGGCA TACCTGGGCA AGCTGGCTGG TTCAGTCTGG TGTCTCTCTT CTTGCACTTA AAGAGATGGG AGGATGGGAA ACTCTCGAAA TGGTTCAAAG ATACGCCCAC CTTTCAGCCG GGCATCTCAC CGAGCACGCA AGCAAAATCG ATGCGATTAT AAGTCGCAAT GGCACAAATA CGGCACAAGA GGAGAACGTG GTTTACTTAA ATGCGAGGTA A
|
Protein sequence | MSIKLRGGTW HCDFVAPDGS RVRRSLETSD KRQAQELHDR LKAEAWRVKN LGESPKKLFK EACIRWLREK SDKKSIDDDK SIISFWMLHF RETILSDITT EKIMEAVDGM ENRRHRLNWE MSRDRCLRLG KPVPEYKPKL ASKGTKTRHL AILRAILNMA VEWGWLDRAP KISTPRVKNG RIRWLTEEES KRLFAEIAPH FFPVVMFAIT TGLRRSNVTD LEWSQVDLDK KMAWMHPDET KAGNAIGVPL NETACQILRK QQGLHKRWVF VHTKPAYRSD GTKTAAVRKM RTDSNKAWKG ALKRAGISNF RFHDLRHTWA SWLVQSGVSL LALKEMGGWE TLEMVQRYAH LSAGHLTEHA SKIDAIISRN GTNTAQEENV VYLNAR
|
| |