Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2254 |
Symbol | |
ID | 6066951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2480835 |
End bp | 2481986 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641601658 |
Product | integrase catalytic region |
Protein accession | YP_001725217 |
Protein GI | 170020263 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.173899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAA CATTTACAGC AGAGGAAAAA GCCTCTGTTT TTGAACTATG GAAGAACGGA ACAGGCTTCA GTGAAATAGC GAATATCCTG GGTTCAAAAC CCGGAACGAT CTTCACTATG TTAAGGGATA CTGGCGGCAT AAAACCCCAT GAGCGTAAGC GGGCTGTAGC TCACCTGACA CTGTCTGAGC GCGAGGAGAT ACGAGCTGGT TTGTCAGCCA AAATGAGCAT TCGTGCGATA GCTACTGCGC TGAATCGCAG TCCTTCGACG ATCTCACGTG AAGTTCAGCG TAATCGGGGC AGACGCTATT ACAAAGCTGT TGATGCTAAT AACCGAGCCA ACAGAATGGC GAAAAGGCCA AAACCGTGCT TACTGGATCA AAATTTACCA TTGCGAAAGC TTGTTCTGGA AAAGCTGGAG ATGAAATGGT CTCCAGAGCA AATATCAGGA TGGTTAAGGC GAACAAAACC ACGTCAAAAA ACGCTGCGAA TATCACCTGA GACAATTTAT AAAACGCTGT ACTTTCGTAG CCGTGAAGCG CTACACCACC TGAATATACA GCATCTGCGA CGGTCGCATA GCCTTCGCCA TGGCAGGCGT CATACCCGCA AAGGCGAAAG AGGTACGATT AACATAGTGA ACGGAACACC AATTCACGAA CGTTCCCGAA ATATCGATAA CAGACGCTCT CTGGGGCATT GGGAGGGCGA TTTAGTCTCA GGTACAAAAA ACTCTCATAT AGCCACACTT GTAGACCGAA AATCACGTTA TACGATCATC CTTAGACTCA GGGGCAAAGA TTCTGTCTCA GTAAATCAGG CTCTTACCGA CAAATTCCTG AGTTTACCGT CAGAACTCAG AAAATCACTG ACATGGGACA GAGGAATGGA ACTGGCCAGA CATCTAGAAT TTACTGTCAG CACCGGCGTT AAAGTTTACT TCTGCGATCC TCAGAGTCCT TGGCAGCGGG GAACAAATGA GAACACAAAT GGGCTAATTC GGCAGTACTT TCCTAAAAAG ACATGTCTTG CCCAATATAC TCAACATGAA CTGGATCTGG TTGCTGCTCA GCTAAACAAC AGACCGAGAA AGACACTGAA GTTCAAAACA CCGAAAGAGA TAATTGAAAG GGGTGTTGCA TTGACAGATT GA
|
Protein sequence | MRRTFTAEEK ASVFELWKNG TGFSEIANIL GSKPGTIFTM LRDTGGIKPH ERKRAVAHLT LSEREEIRAG LSAKMSIRAI ATALNRSPST ISREVQRNRG RRYYKAVDAN NRANRMAKRP KPCLLDQNLP LRKLVLEKLE MKWSPEQISG WLRRTKPRQK TLRISPETIY KTLYFRSREA LHHLNIQHLR RSHSLRHGRR HTRKGERGTI NIVNGTPIHE RSRNIDNRRS LGHWEGDLVS GTKNSHIATL VDRKSRYTII LRLRGKDSVS VNQALTDKFL SLPSELRKSL TWDRGMELAR HLEFTVSTGV KVYFCDPQSP WQRGTNENTN GLIRQYFPKK TCLAQYTQHE LDLVAAQLNN RPRKTLKFKT PKEIIERGVA LTD
|
| |