Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2022 |
Symbol | |
ID | 5588636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2005107 |
End bp | 2006315 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640925693 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001463096 |
Protein GI | 157156692 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000253507 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACGAT TACAGGCATT TAAATTCCAG TTAAGACCAG GTGGTCAACA GGAGTGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC ACGTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ACGCCACTGA AACGCAATGG CTTAAAGATT CGCCCTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTAAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAATAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACGGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGATC GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGGGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAATAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC CGTATCGCAA ATATCCGCAG GGACTACCTT CATAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GAGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGCGTGCTG TGGTCATACA GCGAAAGAAA ATCGCCTGTC ACAAAGTCAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAACGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCTCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQECE MRRFAGACRF VFNRALARQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKDSPSQPLQ QSLKDLERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMI GLDAGVAKLA TLSDGTVFGP VNSFQKNQKT LARLQRQLSR KVKFSNNWQK QKRKIQRLHS RIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL DQGWYEMRRQ LEYKQLWSGG QVLAVPPAYT SQRCACCGHT AKENRLSQSQ FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRSL KQEPTEMIQA TA
|
| |