Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_E0068 |
Symbol | |
ID | 5585860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009790 |
Strand | - |
Start bp | 65257 |
End bp | 66795 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640913953 |
Product | IS66 family transposase |
Protein accession | YP_001451603 |
Protein GI | 157149542 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACTGGG AGCGCGAAAT AGACCATCTG CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG ACTGGTAGGG TGTATGACCC GGCTGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG ATGCGTAGTG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGCCGTT CACTGCTGTC GGGCTGGGTG GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCCGG GTCAGCGTTG GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG CGAAAAACGA AACCGCTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAAACC CTGTCGCGAC ACTCAGAACT GGCGAAAGCG TTCGCATACG CCCTGAACCA GTGGCCGGCG CTGACGTACT ATGCAGATGA TGGCTGGGCT GAGGCGGACA ATAACATCGC TGAAAATGCG TTGCGGATGG TCAGTCTGGG CCGCAAAAAC TACCTGTTCT TCGGTTCGGA TCATGGAGGA GAGCGGGGAG CGCTGCTGTA CAGCCTGATC GGGACGTGCA AACTGAACGG AGTGGAGCCA GAAAGCTACC TCCGCTATGT CCTTGACGTC ATAGCCGACT GGCCGATAAA CCGGGTCGGC GAACTGCTCC CCTGGCGCGT AGCACTGCCG ACTGAATAA
|
Protein sequence | MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNWEREIDHL QAQLDKLRRM NFGSRSEKVS RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LSRSLLSGWV DACCRLLSPL EEALHGYVMT DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA LTYYADDGWA EADNNIAENA LRMVSLGRKN YLFFGSDHGG ERGALLYSLI GTCKLNGVEP ESYLRYVLDV IADWPINRVG ELLPWRVALP TE
|
| |