Gene ECH74115_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4549 
Symbol 
ID6966767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4215873 
End bp4217000 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content52% 
IMG OID643388260 
ProductATPase, AFG1 family 
Protein accessionYP_002272695 
Protein GI209400662 
COG category[R] General function prediction only 
COG ID[COG1485] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000104108 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGCG TTACTCCAAC ATCGCAATAC CTGAAGGCGC TCAATGAAGG CAGCCATCAA 
CCCGACGACG TTCAAAAAGA GGCCGTCAGC CGCTTGGAAA TTATTTATCA GGAACTCATC
AATAGCACGC CACCAGCCCC CAGGACGAGT GGGCTAATGG CGCGGGTCGG TAAGCTGTGG
GGTAAACGGG AAGACACAAA GCATACGCCA GTGCGTGGCT TATATATGTG GGGCGGTGTA
GGACGCGGGA AAACCTGGCT GATGGACCTT TTCTATCAAA GCCTGCCGGG AGAGCGGAAA
CAGCGCCTGC ACTTTCACCG TTTTATGCTG CGGGTGCATG AAGAGCTAAC TGCCTTACAG
GGGCAGACCG ATCCGCTGGA AATTATTGCT GATCGTTTTA AAGCAGAAAC TGACGTGCTC
TGTTTTGACG AATTTTTTGT CTCTGATATT ACCGATGCCA TGCTACTTGG CGGCCTGATG
AAAGCCCTGT TCGCCCGCGG TATTACCCTG GTAGCGACGT CAAATATTCC GCCGGACGAA
CTTTATCGAA ATGGCTTGCA ACGTGCGCGT TTTCTGCCTG CAATCGATGC CATTAAACAG
CATTGTGATG TAATGAACGT GGACGCTGGT GTTGATTATC GTCTGCGTAC ACTCACTCAG
GCGCATCTGT GGCTTTCGCC ACTCCACGAT GAAACCCGGG CGCAAATGGA TAAACTATGG
TTGGCGCTGG CGGGGGGGAA ACGAGAAAAT TCACCGACGT TAGAAATCAA CCATCGGCCA
TTAGCGACAA TGGGCGTCGA GAACCAGACG CTGGCGGTCT CTTTTACTAC GCTGTGCGTC
GACGCCCGCA GTCAGCATGA CTATATTGCG CTCTCACGTC TCTTTCATAC GGTCATGTTG
TTTGATGTAC CAGTTATGAC GCGGTTGATG GAGAGCGAAG CGCGGCGCTT TATTGCGCTG
GTGGATGAGT TTTACGAGCG CCATGTCAAA TTAGTGGTGA GTGCAGAAGT GCCGCTGTAT
GAAATTTATC AGGGCGATCG GCTGAAGTTT GAGTTCCAGC GTTGCCTGTC ACGTCTGCAA
GAGATGCAAA GCGAAGAGTA TCTGAAGCGC GAGCATTTAG CGGGTTAA
 
Protein sequence
MQSVTPTSQY LKALNEGSHQ PDDVQKEAVS RLEIIYQELI NSTPPAPRTS GLMARVGKLW 
GKREDTKHTP VRGLYMWGGV GRGKTWLMDL FYQSLPGERK QRLHFHRFML RVHEELTALQ
GQTDPLEIIA DRFKAETDVL CFDEFFVSDI TDAMLLGGLM KALFARGITL VATSNIPPDE
LYRNGLQRAR FLPAIDAIKQ HCDVMNVDAG VDYRLRTLTQ AHLWLSPLHD ETRAQMDKLW
LALAGGKREN SPTLEINHRP LATMGVENQT LAVSFTTLCV DARSQHDYIA LSRLFHTVML
FDVPVMTRLM ESEARRFIAL VDEFYERHVK LVVSAEVPLY EIYQGDRLKF EFQRCLSRLQ
EMQSEEYLKR EHLAG