Gene EcE24377A_D0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_D0019 
Symbol 
ID5585763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009788 
Strand
Start bp16360 
End bp17898 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content59% 
IMG OID640913839 
ProductIS66 family transposase 
Protein accessionYP_001451489 
Protein GI157149433 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA 
GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACTGGG AGCGCGAAAT AGACCATCTG
CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC
CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG
ACTGGTAGGG TGTATGACCC GGCTGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG
CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC
CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG
ATGCGTAGTG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC
GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG
GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG
TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGCCGTT CACTGCTGTC GGGCTGGGTG
GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT
GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG
ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCCGG GTCAGCGTTG
GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT
CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT
CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT
GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG
TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG
CGAAAAACGA AACCGCTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAAACC
CTGTCGCGAC ACTCAGAACT GGCGAAAGCG TTCGCATACG CCCTGAACCA GTGGCCGGCG
CTGACGTACT ATGCAGATGA TGGCTGGGCT GAGGCGGACA ATAACATCGC TGAAAATGCG
TTGCGGATGG TCAGTCTGGG CCGCAAAAAC TACCTGTTCT TCGGTTCGGA TCATGGAGGA
GAGCGGGGAG CGCTGCTGTA CAGCCTGATC GGGACGTGCA AACTGAACGG AGTGGAGCCA
GAAAGCTACC TCCGCTATGT CCTTGACGTC ATAGCCGACT GGCCGATAAA CCGGGTCGGC
GAACTGCTCC CCTGGCGCGT AGCACTGCCG ACTGAATAA
 
Protein sequence
MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNWEREIDHL QAQLDKLRRM NFGSRSEKVS 
RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC
PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP
GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LSRSLLSGWV DACCRLLSPL EEALHGYVMT
DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH
LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL
YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA
LTYYADDGWA EADNNIAENA LRMVSLGRKN YLFFGSDHGG ERGALLYSLI GTCKLNGVEP
ESYLRYVLDV IADWPINRVG ELLPWRVALP TE