Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1063 |
Symbol | |
ID | 6066036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1155680 |
End bp | 1156876 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641600475 |
Product | integrase family protein |
Protein accession | YP_001724057 |
Protein GI | 170019103 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAAGAA CGACACGCCC CCTGACCAAC ACAGAAGTAC TGCGCGCTAA AGCGTTAGAA AAGGATCTAA CGTTGCATGA TGGCGATGGT CTTTTTCTAC TCGTTAAAAC GAACGGTAAG AAGTTATGGC GTTTCCGTTA TCAACGTCCG GCAACAAAGC AACGGACAAT GATGGGGCTA GGAGCCTTCC CAGCCCTTTC ACTTGCTGAC GCCCGACGCT TAAGAGCGGA TTACCTTTCC TTGTTAGCCA ACGGAATTGA CCCGCAAATT CAAGCTGAAA TTGCAGAGGA ACAGCAGCAA ATCGCACAGG ACAGTATTTT CTCGACGGTC GCCGCTAATT GGTTTCAGCT CAAAAGCAAA AGTGTTACCC CTGATTATGC AAAAGATATT TGGCGCTCAT TGGAAAAAGA TGTATTCCCC GCCGTTGGTG AGATGCCCGT TCAGCAGATC AAAGCTAGAA CATTGGTCGA AGCACTTGAG CCAGTCAAAG CTCGTGGGGC ATTAGAGACT GTACGTCGTC TGGTGCAACG CATTAACGAA ATAATGATTT ATGCGGTTAA CACTGGCTTG ATTGATGCAA ACCCAGCATC AGGTGTTGGC ATGGCCTTCG AAAAGCCAAA AAAACAAAAC ATGCCGACGC TTCGACCTGA AGAATTACCA AAGCTGATGC GTTCTTTAGT CATGTCAAAT CTGTCTATCC CGACTCGCTG TCTAATTGAA TGGCAACTCC TGACTCTTGT GCGCCCTTCT GAAGCCTCCA GTACTCGGTG GGAAGAAATC GATCTTCATG CAAAGCTCTG GACGATTCCT GCCGAACGGA TGAAGGCTAA ACGGGAACAC ATAATTCCTC TATCATCTCA GGCATTAGAG ATTCTTAATG TGATGAAGCC TATTAGTGCT CATCGTGAAT ATGTTTTTCC GAGTCGGAAT GACCCAAAGA AACCAATGAA CAGTCAGACT GCAAATGCAG CTTTAAAACG TATTGGTTTT GGCGGAAAAT TAGTTGCCCA TGGATTACGT TCAATAGCAA GTACAGCCAT GAATGAAGCT GGATTAAATC CTGATGTTAT CGAGTCTGCC TTAGCCCACA GTGATAAAAA TGAAGTTAGA AAAGCATACA ATCGTTCTAC TTATCTCGTG CAGCGAATTG AATTGATGGA TTGGTGGGGA GAATACGTTA AAAATAAAAG GGGTTAA
|
Protein sequence | MARTTRPLTN TEVLRAKALE KDLTLHDGDG LFLLVKTNGK KLWRFRYQRP ATKQRTMMGL GAFPALSLAD ARRLRADYLS LLANGIDPQI QAEIAEEQQQ IAQDSIFSTV AANWFQLKSK SVTPDYAKDI WRSLEKDVFP AVGEMPVQQI KARTLVEALE PVKARGALET VRRLVQRINE IMIYAVNTGL IDANPASGVG MAFEKPKKQN MPTLRPEELP KLMRSLVMSN LSIPTRCLIE WQLLTLVRPS EASSTRWEEI DLHAKLWTIP AERMKAKREH IIPLSSQALE ILNVMKPISA HREYVFPSRN DPKKPMNSQT ANAALKRIGF GGKLVAHGLR SIASTAMNEA GLNPDVIESA LAHSDKNEVR KAYNRSTYLV QRIELMDWWG EYVKNKRG
|
| |