Gene EcE24377A_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2022 
Symbol 
ID5588636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2005107 
End bp2006315 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID640925693 
ProductIS605 family transposase OrfB 
Protein accessionYP_001463096 
Protein GI157156692 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000253507 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACGAT TACAGGCATT TAAATTCCAG TTAAGACCAG GTGGTCAACA GGAGTGTGAA 
ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC ACGTCAGAAT
GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT
GAGTGGAAAA ACGCCACTGA AACGCAATGG CTTAAAGATT CGCCCTCACA GCCATTGCAA
CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT
CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTAAAGCTC
GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAATAGC
CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACGGTCAGCC AGTCCTGCGG TAAGTGGTAC
ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGATC
GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGGGCCT
GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC
AAGGTCAAAT TCAGCAATAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC
CGTATCGCAA ATATCCGCAG GGACTACCTT CATAAAGTCA CAACGACCGT CAGCAAAAAC
CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT
ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG
GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GAGTGGCGGT
CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGCGTGCTG TGGTCATACA
GCGAAAGAAA ATCGCCTGTC ACAAAGTCAA TTCAGATGCC AGGTATGTGG ATATACAGCG
AACGCCGATG TAAACGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT
GGAGAGATGG TGCAGTCAGG CCGCTCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG
ACAGCCTGA
 
Protein sequence
MKRLQAFKFQ LRPGGQQECE MRRFAGACRF VFNRALARQN ENHEAGNKYI PYGKMASWLV 
EWKNATETQW LKDSPSQPLQ QSLKDLERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL
DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMI
GLDAGVAKLA TLSDGTVFGP VNSFQKNQKT LARLQRQLSR KVKFSNNWQK QKRKIQRLHS
RIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL
DQGWYEMRRQ LEYKQLWSGG QVLAVPPAYT SQRCACCGHT AKENRLSQSQ FRCQVCGYTA
NADVNGARNI LAAGHAVLAC GEMVQSGRSL KQEPTEMIQA TA