Gene ECH74115_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4044 
SymbolrelA 
ID6972047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3738213 
End bp3740447 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content54% 
IMG OID643387806 
ProductGDP/GTP pyrophosphokinase 
Protein accessionYP_002272249 
Protein GI209400075 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID[TIGR00691] (p)ppGpp synthetase, RelA/SpoT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000409177 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.332873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCGG TAAGAAGTGC ACATATCAAT AAGGCTGGTG AATTTGATCC GGAAAAATGG 
ATCGCAAGTC TGGGTATTAC CAGCCAGAAG TCGTGTGAGT GCTTAGCCGA AACCTGGGCG
TATTGTCTGC AACAGACGCA GGGGCATCCG GATGCCAGTC TGTTATTGTG GCGTGGTGTT
GAGATGGTGG AGATCCTCTC GACATTAAGT ATGGACATTG ACACGCTGCG GGCGGCGCTG
CTGTTCCCTC TGGCTGATGC CAACGTAGTC AGCGAAGATG TGCTGCGTGA GAGCGTCGGT
AAGTCGGTCG TTAACCTTAT TCACGGCGTG CGTGATATGG CGGCGATCCG CCAGCTGAAA
GCGACGCACA CTGATTCTGT TTCCTCCGAA CAGGTCGATA ACGTTCGCCG GATGTTATTG
GCGATGGTCG ATGATTTTCG CTGCGTGGTC ATCAAACTGG CGGAGCGTAT TGCTCATCTG
CGTGAAGTAA AAGATGCGCC GGAAGATGAA CGCGTACTGG CGGCAAAAGA GTGCACCAAT
ATCTACGCGC CGTTGGCAAA CCGTCTTGGG ATTGGGCAAC TGAAATGGGA GCTGGAAGAT
TACTGCTTCC GTTATCTCCA CCCGACCGAA TACAAACGCA TCGCAAAACT GTTGCATGAA
CGCCGTCTCG ACCGCGAACA CTATATCGAA GAGTTTGTCG GTCATCTGCG CGCTGAGATG
AAAGCTGAAG GCGTTAAAGC TGAAGTGTAT GGTCGTCCGA AACACATCTA CAGCATCTGG
CGCAAAATGC AGAAAAAGAA CCTCGCCTTC GATGAGCTGT TTGATGTGCG TGCGGTACGT
ATTGTCGCCG AGCGTTTACA GGATTGTTAT GCCGCACTGG GGATAGTGCA CACTCACTAT
CGCCACCTGC CGGATGAGTT TGACGATTAC GTCGCTAACC CGAAACCAAA CGGTTATCAG
TCTATTCATA CCGTGGTTCT GGGGCCGGGT GGAAAAACCG TTGAGATCCA AATCCGCACC
AAACAGATGC ATGAAGATGC AGAGTTGGGT GTTGCTGCGC ACTGGAAATA TAAAGAGGGC
GCGGCTGCTG GCGGCGCACG TTCGGGACAT GAAGACCGGA TTGCCTGGCT GCGTAAACTG
ATTGCGTGGC AGGAAGAGAT GGCTGATTCC GGCGAAATGC TCGACGAAGT ACGCAGCCAG
GTCTTTGACG ACCGGGTGTA CGTCTTTACG CCTAAAGGTG ATGTCGTTGA TTTGCCTGCG
GGATCAACGC CGCTGGACTT CGCTTACCAC ATCCACAGTG ATGTCGGACA CCGCTGTATC
GGGGCAAAAA TTGGCGGGCG CATTGTGCCG TTCACCTACC AGCTGCAAAT GGGCGACCAG
ATTGAAATTA TCACCCAGAA ACAGCCGAAC CCCAGCCGTG ACTGGTTAAA CCCAAACCTC
GGTTACGTCA CAACCAGCCG TGGGCGTTCG AAAATTCACG CCTGGTTCCG TAAACAGGAC
CGTGACAAAA ACATTCTGGC TGGGCGGCAA ATCCTTGACG ACGAGCTGGA ACATCTGGGG
ATCAGCCTGA AAGAAGCAGA AAAACATCTG CTGCCGCGTT ACAACTTCAA TGATGTCGAC
GAGTTGCTGG CGGCGATTGG TGGCGGGGAT ATCCGTCTCA ATCAGATGGT GAACTTCCTG
CAATCGCAAT TTAATAAGCC GAGTGCCGAA GAGCAGGACG CCGCCGCGCT GAAACAGCTT
CAGCAAAAAA GCTACACGCC GCAAAACCGC AGTAAAGATA ACGGTCGTGT AGTGGTTGAA
GGTGTTGGTA ACCTGATGCA CCACATCGCG CGCTGCTGCC AGCCGATTCC TGGAGATGAG
ATTGTCGGCT TCATTACCCA GGGACGCGGT ATTTCAGTAC ACCGCGCCGA TTGCGAACAA
CTGGCGGAAC TGCGCTCCCA TGCGCCAGAA CGCATTGTTG ACGCGGTATG GGGTGAGAGC
TACTCCGCCG GATATTCGCT GGTGGTCCGC GTGGTGGCTA ATGATCGTAG TGGGTTGTTA
CGTGATATCA CGACCATTCT CGCCAACGAG AAGGTGAACG TGCTTGGCGT TGCCAGCCGT
AGCGACACCA AACAGCAACT GGCGACCATC GACATGACCA TTGAGATTTA CAACCTGCAA
GTGCTGGGGC GCGTGCTGGG TAAACTCAAC CAGGTGCCGG ATGTTATCGA CGCGCGTCGG
TTGCACGGGA GTTAG
 
Protein sequence
MVAVRSAHIN KAGEFDPEKW IASLGITSQK SCECLAETWA YCLQQTQGHP DASLLLWRGV 
EMVEILSTLS MDIDTLRAAL LFPLADANVV SEDVLRESVG KSVVNLIHGV RDMAAIRQLK
ATHTDSVSSE QVDNVRRMLL AMVDDFRCVV IKLAERIAHL REVKDAPEDE RVLAAKECTN
IYAPLANRLG IGQLKWELED YCFRYLHPTE YKRIAKLLHE RRLDREHYIE EFVGHLRAEM
KAEGVKAEVY GRPKHIYSIW RKMQKKNLAF DELFDVRAVR IVAERLQDCY AALGIVHTHY
RHLPDEFDDY VANPKPNGYQ SIHTVVLGPG GKTVEIQIRT KQMHEDAELG VAAHWKYKEG
AAAGGARSGH EDRIAWLRKL IAWQEEMADS GEMLDEVRSQ VFDDRVYVFT PKGDVVDLPA
GSTPLDFAYH IHSDVGHRCI GAKIGGRIVP FTYQLQMGDQ IEIITQKQPN PSRDWLNPNL
GYVTTSRGRS KIHAWFRKQD RDKNILAGRQ ILDDELEHLG ISLKEAEKHL LPRYNFNDVD
ELLAAIGGGD IRLNQMVNFL QSQFNKPSAE EQDAAALKQL QQKSYTPQNR SKDNGRVVVE
GVGNLMHHIA RCCQPIPGDE IVGFITQGRG ISVHRADCEQ LAELRSHAPE RIVDAVWGES
YSAGYSLVVR VVANDRSGLL RDITTILANE KVNVLGVASR SDTKQQLATI DMTIEIYNLQ
VLGRVLGKLN QVPDVIDARR LHGS