Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0329 |
Symbol | |
ID | 5590906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 333837 |
End bp | 335045 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640919515 |
Product | site-specific recombinase, phage integrase family protein |
Protein accession | YP_001457101 |
Protein GI | 157159783 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.0019463 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTA ATGCGCGACA GGTAGACGCA GCTAAACCCA GAGAGAAAGC CTATAAGCTG GCAGATGGTG CTGGTTTGTA TCTTGAGGTT GTTCCCTCTG GTTCCAGATA CTGGCGGATG AAATATCGCT TCAATGGAAA AGAGAAGCGT ATGGCCTTTG GTGTCTATCC GGCAGTGTCC CTTGCACAAG CGAGGGCACT GCGTGACGAC GCCAAGAAAA AGCTTGCCGA AGGTATCGAT CCATCGCTTG CCAAGAAAGA AGAAAAGCTG GTTCGAGATG TGCAGCTAAA TAATACGTTT CAGGCTGTGG CACTTGAATG GCATGGAACG AAGGTGAGCC GATGGTCAGA AGGTTATGCC TCGGACATTA TCGAAGCCTT TAATAAAGAT ATTTTTCCCT ATATTGGCCA ACAACCGGTG AATGAAATCA AACCGCTGGT TCTGCTTAAT GTGCTGCGTC GAATTGAAAG CCGTGGCGCG ACAGAGAAGG CCAAGAAGGT TCGCCAGCGT TGCAGCGAAG TCTTTCGTTA CGCCATCGTA ACCGGTCGTG CGGAATACAA TCCTGCAGCG GATCTTACCA GCGCGATGTC AGGGCATGAA TCGAAGCATT ATCCCTTCCT TACTGTTGAG GAGTTACCAG ACTTTTTTAA AGCTCTCGCA GGCTACACAG GAAGCCCGTT AGTTGTTCTT GCGGCACGTC TGCTGATCCT CACGGGAGTT CGCACTGGCG AACTCCGAGG TGCTTTCTGG AGTGAGTTTG ATCTTGAAAA AGCGGTGTGG GAGATACCTG CCGAGCGTAT GAAGATGAAA CGGCCTCACC TTGTCCCCCT CTCTACCCAA GCGCTGGAAA TCGTACAGCA ACTCAAAGTG ATGTCAGGGC AATATCCACT GGTGTTCCCG GGACGTAATG ATCCCCGCAA GACGATGAGT GAAGCGAGTA TTAATCAGGT TTTTAAGCGG ATTGGATATA CGGGGAGGGT AACGGGGCAT GGTTTCCGTC ACACGATGAG TACGATTTTG CATGAGGAAG GTTTCAATAC AGCGTGGATT GAAACCCAGC TGGCTCACGT TGATAAAAAT GCGATTCGTG GGACGTATAA CCATGCGTTG TATTTGGAAG GACGTAGGGA GATGATGCAG TGGTATGCGG ACTATATAAA CGCTAATCAT AATTCATGTA TTTCTTTGCT GGTGAACGCT ACAAAGTAG
|
Protein sequence | MKLNARQVDA AKPREKAYKL ADGAGLYLEV VPSGSRYWRM KYRFNGKEKR MAFGVYPAVS LAQARALRDD AKKKLAEGID PSLAKKEEKL VRDVQLNNTF QAVALEWHGT KVSRWSEGYA SDIIEAFNKD IFPYIGQQPV NEIKPLVLLN VLRRIESRGA TEKAKKVRQR CSEVFRYAIV TGRAEYNPAA DLTSAMSGHE SKHYPFLTVE ELPDFFKALA GYTGSPLVVL AARLLILTGV RTGELRGAFW SEFDLEKAVW EIPAERMKMK RPHLVPLSTQ ALEIVQQLKV MSGQYPLVFP GRNDPRKTMS EASINQVFKR IGYTGRVTGH GFRHTMSTIL HEEGFNTAWI ETQLAHVDKN AIRGTYNHAL YLEGRREMMQ WYADYINANH NSCISLLVNA TK
|
| |