Gene ECH74115_4568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4568 
Symbol 
ID6966599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4238119 
End bp4240059 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content51% 
IMG OID643388279 
Productregulatory protein CsrD 
Protein accessionYP_002272714 
Protein GI209396452 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00218723 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.217152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAA CGACGAAATT TTCGGCCTTT GTTACGCTGC TCACCGGGTT AACAATTTTT 
GTGACTTTGC TGGGCTGTTC GCTAAGTTTC TACAACGCCA TTCAGTATAA GTTTAGTCAT
CGCGTTCAGG CGGTGGCGAC GGCGATTGAT ACCCACCTTG TGTCGAATGA CTTCAGCGCA
TTAAGGCCAC AAATTACCGA ATTAATGATG TCGGCAGATA TCGTTCGTGT AGACCTGCTC
CATGGTGATA AGCAGGTTTA TACCCTGGCC AGAAATGGTA GTTATCGTCC GGTTGGCACC
AGCGATCTGT TTCGTGAACT GAGCGTTCCG TTGATAAAGC ATCCGGGGAT GTCGTTGCGT
CTGGTTTATC AGGATCCGAT GGGCAACTAT TTCCATTCGT TGATGACCAC CGCGCCGCTC
ACAGGGGCGA TTGGCTTTAT CATTCTTATG CTCTTTCTGG CGGTACGCTG GTTACAACGG
CAACTTGCCG GGCAAGAATT GCTGGAAACC CGGGCTACTC GTATCTTAAA CGGTGAGCGT
GGCTCTAATG TGTTGGGAAC CATCTATGAA TGGCCGCCCA GAACCAGCAG TGCGCTGGAT
ACGCTGCTTC GTGAAATTCA GAACGCACGC GAACAACACA GCCGTCTTGA TACGCTGATC
CGCTCTTATG CCGCACAGGA CGTGAAAACC GGCCTCAATA ACCGACTCTT TTTCGATAAT
CAGTTAGCAA CGTTACTGGA AGATCAGGAG AAAGTAGGTA CCCACGGGAT CGTGATGATG
ATTCGTCTGC CGGATTTCAA TATGTTGAGC GATACCTGGG GGCACAGCCA GGTTGAAGAA
CAGTTCTTCA CTCTGACGAA TCTGCTGTCG ACATTTATGA TGCGCTACCC TGGCGCACTG
CTGGCGCGTT ACCACCGCAG TGATTTTGCT GCGCTGTTAC CGCACCGGAC GTTAAAAGAG
GCAGAGAGCA TCGCCGGTCA GTTAATCAAA GCCGTCGATA CCTTGCCGAA CAATAAAATG
CTCGATCGCG ACGATATGAT CCACATTGGT ATCTGCGCCT GGCGTAGTGG TCAGGATACC
GAGCAGGTAA TGGAACATGC AGAGTCTGCC ACGCGTAATG CGGGATTGCA GGGCGGCAAT
AGCTGGGCTA TTTACGATGA CTCGTTGCCT GAAAAAGGAC GCGGTAATGT TCGCTGGCGT
ACGCTTATCG AGCAAATGCT CAGTCGCGGC GGCCCGCGCC TTTATCAAAA ACCGGCGGTT
ACTCGCGAAG GTCAGGTTCA TCATCGCGAA CTCATGTGCC GCATCTTTGA TGGTAATGAA
GAGGTTAGCT CGGCGGAGTA TATGCCGATG GTCTTGCAGT TTGGCTTATC GGAAGAGTAT
GACCGTCTGC AAATCAGCCG TCTTATTCCA CTATTGCGTT ACTGGCCAGA GGAAAATCTG
GCGATTCAGG TTACCGTTGA GTCGCTGATT CGCCCGCGTT TTCAGCGTTG GCTGCGCGAT
ACGTTAATGC AATGTGAAAA ATCACAACGA AAACGCATAA TTATTGAACT TGCAGAGGCC
GATGTAGGTC AACATATCAG TCGTTTACAA CCTGTTATTC GTTTAGTGAA TGCTTTAGGG
GTACGGGTAG CCGTCAACCA GGCTGGTTTG ACGCTGGTAA GCACCAGTTG GATCAAAGAA
CTTAATGTTG AGTTACTCAA GCTCCATCCG GGGCTGGTCA GAAACATTGA GAAGCGAACG
GAGAACCAGC TGCTGGTTCA AAGCCTGGTG GAAGCCTGCT CAGGGACCAG CACCCAGGTT
TACGCCACCG GTGTGCGTTC GCGAAGCGAG TGGCAGACCC TGATTCAGCG CGGTGTTACA
GGCGGGCAAG GGGATTTTTT CGCGTCCTCA CAGCCACTTG ATACTAACGT GAAAAAATAT
TCACAAAGAT ACTCGGTTTA A
 
Protein sequence
MRLTTKFSAF VTLLTGLTIF VTLLGCSLSF YNAIQYKFSH RVQAVATAID THLVSNDFSA 
LRPQITELMM SADIVRVDLL HGDKQVYTLA RNGSYRPVGT SDLFRELSVP LIKHPGMSLR
LVYQDPMGNY FHSLMTTAPL TGAIGFIILM LFLAVRWLQR QLAGQELLET RATRILNGER
GSNVLGTIYE WPPRTSSALD TLLREIQNAR EQHSRLDTLI RSYAAQDVKT GLNNRLFFDN
QLATLLEDQE KVGTHGIVMM IRLPDFNMLS DTWGHSQVEE QFFTLTNLLS TFMMRYPGAL
LARYHRSDFA ALLPHRTLKE AESIAGQLIK AVDTLPNNKM LDRDDMIHIG ICAWRSGQDT
EQVMEHAESA TRNAGLQGGN SWAIYDDSLP EKGRGNVRWR TLIEQMLSRG GPRLYQKPAV
TREGQVHHRE LMCRIFDGNE EVSSAEYMPM VLQFGLSEEY DRLQISRLIP LLRYWPEENL
AIQVTVESLI RPRFQRWLRD TLMQCEKSQR KRIIIELAEA DVGQHISRLQ PVIRLVNALG
VRVAVNQAGL TLVSTSWIKE LNVELLKLHP GLVRNIEKRT ENQLLVQSLV EACSGTSTQV
YATGVRSRSE WQTLIQRGVT GGQGDFFASS QPLDTNVKKY SQRYSV