Gene ECH74115_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3390 
Symbol 
ID6969115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3134306 
End bp3135508 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID643387198 
Productcompetence damage-inducible protein A 
Protein accessionYP_002271661 
Protein GI209395745 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAG TGGAAATGCT ATCCACCGGG GATGAAGTGT TACACGGGCA AATCGTTGAC 
ACTAACGCTG CCTGGCTGGC CGATTTTTTC TTTCATCAGG GGTTGCCATT ATCTCGCCGC
AATACGGTGG GGGATAACCT TGATGACTTA GTCACCATTC TTCGCGAACG TAGTCAGCAC
GCCGATGTGC TGATCGTTAA CGGCGGGCTG GGACCGACCA GCGATGATTT AAGCGCACTC
GCCGCTGCGA CAGCAAAAGG TGAAGGCCTG GTGCTGCATG AAGCCTGGCT CAAAGAGATG
GAACGCTATT TCCACGAACG TGGACGAGTA ATGGCACCGA GCAACCGTAA ACAAGCGGAG
CTGCCTGCCA GTGCTGAATT TATCAATAAC CCGGTAGGCA CCGCCTGTGG TTTTGCCGTG
CAGCTTAATC GTTGCCTGAT GTTCTTTACT CCCGGCGTAC CGTCAGAATT TAAGGTGATG
GTCGAGCACG AAATCCTGCC GCGTCTGCGC GAGCGTTTTT CTTTACCACA GCCGCCGGTT
TGTCTGCGTT TAACTACCTT TGGTCGTTCG GAAAGCGATC TGGCACAAAG CCTGGACACT
CTACAACTGC CGCCGGGCGT AACAATGGGC TATCGCTCCT CAATGCCGAT CATCGAACTG
AAACTCACCG GACCGGCAAG CGAGCAACAG GCGATGGAAA AACTGTGGCT GGACGTTAAA
CGTGTTGCCG GACAGAGCGT GATTTTCGAA GGCACCGAAG GACTGCCCGC GCAGATCAGT
CGCGAATTGC AAAACCGCCA GTTCAGCCTG ACGTTGAGCG AGCAATTCAC CGGTGGTTTA
TTGGCTTTGC AACTTTCTCG CGCAGGTGCT CCACTGCTGG CGTGTGAAGT GGTTCCTTCA
CAGGAAGAAA CCCTGGCGCA AACTGCGCAC TGGATTACAG AACGGCGAGC CAACCATTTT
GCCGGGCTGG CACTGGCTGT TTCGGGTTTC GAGAACGAGC ATCTCAACTT TGCGCTAGCC
ACGCCAGATG GCACTTTAGC TCTGCGTGTG CGTTTCAGCA CTACACGCTA TAGCCTGGCT
ATCCGTCAGG AAGTGTGCGC AATGATGGCA CTGAATATGC TACGCCGTTG GTTAAACGGC
CAGGACATTG CCAGTGAGCA TGGCTGGATT GAGGTTGTTG AGTCCATGAC CTTATCCGTC
TGA
 
Protein sequence
MLKVEMLSTG DEVLHGQIVD TNAAWLADFF FHQGLPLSRR NTVGDNLDDL VTILRERSQH 
ADVLIVNGGL GPTSDDLSAL AAATAKGEGL VLHEAWLKEM ERYFHERGRV MAPSNRKQAE
LPASAEFINN PVGTACGFAV QLNRCLMFFT PGVPSEFKVM VEHEILPRLR ERFSLPQPPV
CLRLTTFGRS ESDLAQSLDT LQLPPGVTMG YRSSMPIIEL KLTGPASEQQ AMEKLWLDVK
RVAGQSVIFE GTEGLPAQIS RELQNRQFSL TLSEQFTGGL LALQLSRAGA PLLACEVVPS
QEETLAQTAH WITERRANHF AGLALAVSGF ENEHLNFALA TPDGTLALRV RFSTTRYSLA
IRQEVCAMMA LNMLRRWLNG QDIASEHGWI EVVESMTLSV