Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2023 |
Symbol | |
ID | 5587940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2006338 |
End bp | 2007285 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640925694 |
Product | IS605 family transposase |
Protein accession | YP_001463097 |
Protein GI | 157158023 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1943] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000292395 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAG AAACCGATAT CCGGCGTGGA AGGCATTGTG TTTTCCTGAT GCATGTTCAC CTGGTCTTTG TCACCAGATA CCGACTCCAG ATTTTTGACC ATGACGCGAC AGAAAAACTA CGCACTTACT TTTCAAATGT ATGTGCTGAT TTTGAAGCTG AACTTGTTGA AATGGATGGC GAACCAGATC ACGTCCATTT GTTAATTAAC TATCCTCCCA AACTGGCGAT ATCCAGTCTG GTAAACAGCC TCAAAGGCGT ATCGGGTAGG TTACTGCGAC GAGATCGACC AGATATTGCA GTCAGGTATT ACTACAAAGG CGTTTTGTGG AGTCCTGGCT ATTTTGCCAG TAGCTGCGGA GGTGCGCCAA TATCCGTCAT CCGCCAATAC ATTGAACAAC AGCAAACACC TGGTCAGGTG GAAAACCGCG CCTTATATCC CCGCCCTGAA GGACGGGGTT TTACGGCGCA CCGGATAAGA ACGCACGGTA TTCACCAGAT CTTTTATCAC TTCAGCCGCC ACTTCTGGCA CCAGCAAAGT CATCGGCGTC TCTGTTTCAT AATCGACAGA AACGCCATTG CTGTTATTGG TGACGGTCAC GGTATACGTT GCTTTGCCCA TGATTCATTT CCCGTTATGA ATGACTTTCC GTTGTTGCGC ACCTTCCATC AGGACTTCAG GAGCCACGAA GAAGTCAATG TTGAAATAAG TATCGTCAGT CATGGCTTCA ATGTTGTGCC ACTTTTCTGG AGGGAACACC GCAAACTGCC CCGCTTCGAT AAGGATCACC TGATCAGGCT CTGCACTGTG TTCATCAGCG TAGCCGAGAT ATTTGACCGC CCCATGCATA ACGGAAAGGC GTGGGTAAAC CCCCGGGCGC GTTCCTTTAT CAAGATGACG TTCGAATATT CCGGCAGGCG CAGTTTGTTT ATTCCAGAAA GGCGTTGA
|
Protein sequence | MKKETDIRRG RHCVFLMHVH LVFVTRYRLQ IFDHDATEKL RTYFSNVCAD FEAELVEMDG EPDHVHLLIN YPPKLAISSL VNSLKGVSGR LLRRDRPDIA VRYYYKGVLW SPGYFASSCG GAPISVIRQY IEQQQTPGQV ENRALYPRPE GRGFTAHRIR THGIHQIFYH FSRHFWHQQS HRRLCFIIDR NAIAVIGDGH GIRCFAHDSF PVMNDFPLLR TFHQDFRSHE EVNVEISIVS HGFNVVPLFW REHRKLPRFD KDHLIRLCTV FISVAEIFDR PMHNGKAWVN PRARSFIKMT FEYSGRRSLF IPERR
|
| |