Gene ECH74115_2535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2535 
Symbol 
ID6971732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2399448 
End bp2401358 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content55% 
IMG OID643386403 
Producthypothetical protein 
Protein accessionYP_002270885 
Protein GI209396774 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.50615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG ATTTTGCACC AGACGGTCAG CTGGCGAAAG CGATACCAGG CTTTAAGCCG 
CGAGAACCAC AGCGACAGAT GGCGGTAGCC GTCACCCAGG CGATAGAAAA AGGCCAGCCG
CTGGTGGTGG AAGCAGGAAC CGGTACGGGC AAAACCTACG CTTACCTGGC CCCTGCGCTG
CGGGCGAAAA AGAAAGTCAT TATCTCGACC GGCTCAAAAG CGTTGCAGGA TCAGCTCTAT
AGCCGCGATT TGCCGACGGT CTCAAAGGCG TTGAAATACA CGGGCAACGT GGCGTTGCTG
AAAGGGCGCT CAAACTACCT CTGCCTCGAA CGTCTCGAAC AGCAGGCGCT GGCGGGGGGC
GATCTTCCGG TACAAATATT AAGCGATGTG ATCCTGCTGC GCTCCTGGTC TAATCAAACA
GTCGATGGTG ATATCAGCAC CTGCGTCAGC GTGGCGGAAG ATTCGCAGGC GTGGCCGCTG
GTCACCAGCA CCAACGACAA CTGTCTTGGC AGCGACTGCC CGATGTATAA AGATTGCTTT
GTGGTCAAAG CACGTAAAAA AGCGATGGAC GCCGATGTGG TGGTGGTAAA CCATCATCTC
TTTCTGGCGG ATATGGTGGT TAAAGAGAGT GGATTTGGCG AGTTGATCCC GGAAGCGGAC
GTCATGATCT TCGACGAAGC CCACCAGCTA CCGGACATTG CCAGCCAGTA TTTTGGTCAG
TCACTCTCCA GTCGACAACT GCTCGACCTG GCAAAAGACA TCACCATCGC CTACCGCACC
GAATTAAAAG ACACCCAGCA GTTACAAAAG TGCGCTGATC GTCTTGCCCA GAGTGCGCAG
GATTTTCGTC TGCAACTCGG TGAACCAGGT TATCGCGGTA ACCTGCGTGA GCTGTTAGCT
AATCCGCAAA TTCAGCGGGC ATTTTTACTG CTCGATGACA CCCTGGAACT TTGTTATGAC
GTGGCGAAAC TGTCACTGGG GCGTTCCGCC TTGCTGGATG CGGCATTTGA GCGCGCCACG
TTGTATCGCA CACGGCTGAA GCGGCTAAAA GAGATCAATC AGCCGGGCTA CAGCTACTGG
TACGAATGCA CTTCGCGCCA TTTTACTCTG GCTCTCACGC CGCTCAGCGT GGCGGATAAA
TTCAAAGAGT TAATGGCGCA AAAGCCCGGT AGCTGGATCT TCACCTCAGC AACGCTGTCG
GTAAACGACG ATCTGCATCA TTTCACCTCG CGGCTTGGGA TAGAACAGGC GGAGTCGTTG
CTATTGCCAA GCCCGTTTGA TTACAGCCGC CAGGCGTTAC TCTGTGTGCC GCGCAACCTG
CCGCAAACCA ATCAACCGGG TTCCGCGCGG CAACTGGCGG CTATGCTGCG CCCGATCATC
GAAGCTAACA ACGGTCGCTG TTTTATGCTT TGTACCTCGC ACGCCATGAT GCGCGATCTG
GCCGAGCAGT TCCGCGCTAC CATGACGCTT CCCGTATTAT TGCAGGGGGA AACCAGCAAA
GGGCAACTGT TGCAGCAATT TGTCAGTGCC GGTAACGCGC TTCTTGTGGC AACCAGTAGT
TTCTGGGAGG GGGTGGACGT GCGTGGCGAT ACATTGTCAT TGGTGATTAT CGACAAATTG
CCGTTTACCT CGCCTGATGA TCCATTGTTA AAAGCGCGCA TGGAAGATTG CCGTTTGCGC
GGTGGCGACC CGTTCGATGA AGTACAACTG CCGGATGCGG TGATTACTCT CAAGCAGGGA
GTCGGGCGAC TGATTCGCGA CGCCGACGAT CGCGGGGTTT TGGTAATTTG TGACAATCGG
CTGGTGATGC GTCCTTACGG CGCGACGTTT CTCGCCAGTC TGCCGCCCGC GCCACGCACC
CGTGACATTG CCCGTGCGGT TCGTTTCCTT GCGATACCAT CCTCCAGGTA A
 
Protein sequence
MTDDFAPDGQ LAKAIPGFKP REPQRQMAVA VTQAIEKGQP LVVEAGTGTG KTYAYLAPAL 
RAKKKVIIST GSKALQDQLY SRDLPTVSKA LKYTGNVALL KGRSNYLCLE RLEQQALAGG
DLPVQILSDV ILLRSWSNQT VDGDISTCVS VAEDSQAWPL VTSTNDNCLG SDCPMYKDCF
VVKARKKAMD ADVVVVNHHL FLADMVVKES GFGELIPEAD VMIFDEAHQL PDIASQYFGQ
SLSSRQLLDL AKDITIAYRT ELKDTQQLQK CADRLAQSAQ DFRLQLGEPG YRGNLRELLA
NPQIQRAFLL LDDTLELCYD VAKLSLGRSA LLDAAFERAT LYRTRLKRLK EINQPGYSYW
YECTSRHFTL ALTPLSVADK FKELMAQKPG SWIFTSATLS VNDDLHHFTS RLGIEQAESL
LLPSPFDYSR QALLCVPRNL PQTNQPGSAR QLAAMLRPII EANNGRCFML CTSHAMMRDL
AEQFRATMTL PVLLQGETSK GQLLQQFVSA GNALLVATSS FWEGVDVRGD TLSLVIIDKL
PFTSPDDPLL KARMEDCRLR GGDPFDEVQL PDAVITLKQG VGRLIRDADD RGVLVICDNR
LVMRPYGATF LASLPPAPRT RDIARAVRFL AIPSSR