Gene EcolC_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0053 
Symbol 
ID6068469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp54119 
End bp55294 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content41% 
IMG OID641599456 
Productintegrase family protein 
Protein accessionYP_001723066 
Protein GI170018112 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.613927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTAA CCGATACACA GATCAAACGT GCAAAACCAC AAGACAAGCC ATACACATTG 
AACGATGGAC AAGGTCTGTC GTTGCTTATC AATCCAGATG GCACGAAAGG CTGGCGTTTC
CGTTTCAGAT TTGCTGGGAA AGCGCGGTTA ATGTCATTTG GCAGCTATGA TTTAGTAAGC
CTCGCAGAAG CACGTGAGAA GCGTGACATC GCCCGTAAGC AGGTTGCTAA TGGCATTGAC
CCAGTAGAGG AACGCAAAGC TTTAAGACTC GCTCAAAAGC TATCAACAGA AAATTCTTTC
GAAGCAATAT GTCGAGAATG GCATACCAAC AAAGCTGACC GCTGGACGGT GGCCTATCGA
GAAGAAATTA TGAAGACTTT TGAGCAAGAT GTATTCCCTT TCATTGGTAA ACGCCCTATC
AGTGAAATTA AACCATTAGA ACTGCTCGAA GTATTGCGAA GAATAGAAAA GCGTGGGGCA
TTAGAGAAGA CCAGAAAAGT GCGGCAAAGA TGTGGTGAAG TCTACCGCTA TGCGATCATA
ACTGGCCGTG CTGAATACAA TCCTGCGCCT GATTTAGCCA TCGCTCTGGC TGTTCCTAAG
CAAAAACATC ATCCTTTTTT ATCCGCTGAA GAGCTACCTC ATTTCATTCA GGATTTGGAA
GCGTATACCG GAAGTATCAT TACTAAAAAT GCTACTAAGA TAGTTATGCT GACCGGCGTT
AGAACGCAGG AAATGCGTTT GGCTACTTGG AATGAGGTTG ATCTTGAGAA AGGCATATGG
GAAATACCTG CAGAAAGGAT GAAAATGCGT AGGCCACACA TTGTTCCTTT ATCTACTCAG
GTAATTGCCC TTTTCGAACA ACTCAAGCCT ATTACCGGCC ATTACCCCTA CATATTTATT
GGAAGGAACA ATCGTAGCAA ACCAATTTCA AAAGAAAGCG TATCTCAAGT AATTGAGTTA
CTTGGTTACA AAGGACGTGC TACAGGTCAC GGTTTTAGAC ATTCATTATC GACAATCTTA
CATGAACATG GATTTGATAG TGCATGGATT GAGATGCAAT TAGCACATGT TGATAAAAAC
AGTATAAGAG GTACTTATAA TCATGCTCAA TATTTAGAGA AAAGATTACA TATGATGCAG
TGGTATAGTG ACTTACTTTA TCCAAAAATA AAATAA
 
Protein sequence
MALTDTQIKR AKPQDKPYTL NDGQGLSLLI NPDGTKGWRF RFRFAGKARL MSFGSYDLVS 
LAEAREKRDI ARKQVANGID PVEERKALRL AQKLSTENSF EAICREWHTN KADRWTVAYR
EEIMKTFEQD VFPFIGKRPI SEIKPLELLE VLRRIEKRGA LEKTRKVRQR CGEVYRYAII
TGRAEYNPAP DLAIALAVPK QKHHPFLSAE ELPHFIQDLE AYTGSIITKN ATKIVMLTGV
RTQEMRLATW NEVDLEKGIW EIPAERMKMR RPHIVPLSTQ VIALFEQLKP ITGHYPYIFI
GRNNRSKPIS KESVSQVIEL LGYKGRATGH GFRHSLSTIL HEHGFDSAWI EMQLAHVDKN
SIRGTYNHAQ YLEKRLHMMQ WYSDLLYPKI K