Gene ECH74115_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3643 
SymbolzipA 
ID6970557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3358743 
End bp3359741 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content55% 
IMG OID643387438 
Productcell division protein ZipA 
Protein accessionYP_002271891 
Protein GI209400232 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG3115] Cell division protein 
TIGRFAM ID[TIGR02205] cell division protein ZipA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000110035 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGG ATTTGCGTCT GATATTAATC ATTGTTGGCG CGATCGCCAT AATCGCTTTA 
CTGGTACATG GTTTCTGGAC CAGCCGTAAA GAACGATCTT CTATGTTCCG CGATCGGCCA
TTAAAACGAA TGAAGTCAAA ACGTGACGAC GATTCTTATG ACGAGGATGT CGAAGATGAT
GAGGGCGTTG GTGAGGTTCG TGTTCACCGC GTGAATCATG CCCCGGCTAA CGCCCAGGAG
CATGAGGCTG CTCGTCCGTC GCCGCAACAC CAGTACCAAC CGCCTTATGC GTCTGCGCAG
CCGCGTCAAC CGGTCCAGCA GCCGCCTGAA GCGCAGGTAC CGCCGCAACA TGCTCCGCGT
CCAGCGCAGC CGGTGCAGCA ACCCGTGCAG CAGCCTGCCT ATCAGCCGCA GCCTGAACAG
CCGTTGCAGC AGCCAGTTTC GCCACAGGTC GCGCCAGCGC CGCAGCCAGT GCATTCAGCA
CCGCAACCGG CACAACAGGC TTTCCAGCCT GCAGAACCCG TAGCGGCACC ACAGCCTGAG
CCTGTAGCGG AACCGGCTCC AGTTATGGAT AAACCGAAGC GCAAAGAAGC GGTGATTATC
ATGAACGTCG CGGCGCATCA CGGTAGCGAG CTAAACGGTG AACTGCTTCT TAACAGCATT
CAACAAGCGG GCTTCATTTT TGGCGATATG AATATTTACC ATCGTCATCT TAGCCCGGAT
GGCAGCGGCC CGGCGTTATT TAGCCTGGCG AATATGGTGA AACCGGGAAC CTTTGATCCT
GAAATGAAGG ATTTCACTAC TCCGGGTGTC ACTATCTTTA TGCAGGTACC GTCTTACGGT
GACGAGCTGC AGAACTTCAA GCTGATGCTG CAATCTGCGC AGCATATTGC CGATGAAGTG
GGCGGTGTCG TGCTTGACGA TCAGCGCCGT ATGATGACTC CGCAGAAATT GCGCGAGTAC
CAGGACATCA TCCGCGAAGT TAAAGACGCC AACGCCTGA
 
Protein sequence
MMQDLRLILI IVGAIAIIAL LVHGFWTSRK ERSSMFRDRP LKRMKSKRDD DSYDEDVEDD 
EGVGEVRVHR VNHAPANAQE HEAARPSPQH QYQPPYASAQ PRQPVQQPPE AQVPPQHAPR
PAQPVQQPVQ QPAYQPQPEQ PLQQPVSPQV APAPQPVHSA PQPAQQAFQP AEPVAAPQPE
PVAEPAPVMD KPKRKEAVII MNVAAHHGSE LNGELLLNSI QQAGFIFGDM NIYHRHLSPD
GSGPALFSLA NMVKPGTFDP EMKDFTTPGV TIFMQVPSYG DELQNFKLML QSAQHIADEV
GGVVLDDQRR MMTPQKLREY QDIIREVKDA NA