Gene EcolC_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3892 
Symbol 
ID6064612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4269935 
End bp4270915 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content56% 
IMG OID641603306 
Producttransposase IS4 family protein 
Protein accessionYP_001726821 
Protein GI170021867 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATC AACTCACCTT CGCCGATAGT GAATTCAGCA CTAAGCGCCG TCAGACCCGA 
AAAGAGATTT TCCTCTCCCG CATGGAGCAG ATTCTGCCAT GGCAGAATAT GACCGCTGTC
ATCGAGCCGT TTTATCCCAA GGCGGGCAAT GGCCGACGGC CCTATCCGCT GGAGACCATG
CTGCGTATTC ACTGCATGCA GCATTGGTAC AACCTGAGCG ACGGTGCCAT GGAAGATGCC
CTGTACGAAA TCGCCTCCAT GCGCCTGTTT GCCCGATTAT CCCTGGATAG CGCCCTGCCG
GATCGCACCA CCATCATGAA TTTCCGCCAC CTGCTCGAGC AGCATCAACT GGCCCGTCAA
TTGTTCAAGA CCATCAATCG CTGGCTGGCC GAAGCAGGCG TCATGATGAC CCAAGGCACT
TTGGTGGATG CCACCATCAT TGAGGCACCC AGCTCTACCA AGAACAAAGA GCAGCAACGC
GATCCGGAGA TGCATCAGAC CAAGAAAGGC AATCAGTGGC ACTTTGGCAT GAAGGCCCAC
ATTGGTGTCG ATGCCAAGAG TGGCCTGACC CACAGCCTGG TCACCACCGC GGCCAACGAG
CATGACCTCA ATCAGCTGGG TAATCTGCTT CATGGAGAGG AGCAATTTGT CTCAGCCGAT
GCCGGCTACC AAGGAGCGCC ACAGCGCGAG GAGCTGGCCG AGGTGGATGT GGACTGGCTG
ATCGCCGAGC GTCCCGGCAA GGTAAAAACC TTGAAGCAGC ATCCGCGCAA GAACAAAACG
GCCATCAACA TCGAATACAT GAAAGCCAGC ATCCGTGCCA GGGTGGAGCA CCCGTTTCGC
ATCATCAAGC GGCAGTTCGG CTTCGTGAAA GCCAGATACA AGCGGCTGCT GAAAAACGAT
AACCAACTGG CGATGTTATT CACCCTGGCC AACCTGTTTC GGGTGGACCA AATGATACGT
CAGTGGGAGA GATCTCACTA A
 
Protein sequence
MSHQLTFADS EFSTKRRQTR KEIFLSRMEQ ILPWQNMTAV IEPFYPKAGN GRRPYPLETM 
LRIHCMQHWY NLSDGAMEDA LYEIASMRLF ARLSLDSALP DRTTIMNFRH LLEQHQLARQ
LFKTINRWLA EAGVMMTQGT LVDATIIEAP SSTKNKEQQR DPEMHQTKKG NQWHFGMKAH
IGVDAKSGLT HSLVTTAANE HDLNQLGNLL HGEEQFVSAD AGYQGAPQRE ELAEVDVDWL
IAERPGKVKT LKQHPRKNKT AINIEYMKAS IRARVEHPFR IIKRQFGFVK ARYKRLLKND
NQLAMLFTLA NLFRVDQMIR QWERSH