Gene ECH74115_5348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5348 
Symbol 
ID6970481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4989856 
End bp4991604 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content49% 
IMG OID643389004 
Productputative frv operon regulatory protein 
Protein accessionYP_002273413 
Protein GI209397591 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT 
GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTTA CCCTTAACGG CAAAGCTCGT ATCTCTGCCA GTGGCAGTGC GGGCTATCAG
TTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTGTTATT ACTGAATACC TTTACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACTTG GGTCGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA
CGCACTTGTT GCCTGGCCAG TCGTCCCGGT TTGGGCCATT TCATTGATGA AACAGAAGAG
AAACGCGTTA TCTTGCTGGC GAACTTGCTA CGCAAAGATC CGTTTTTAAT ACCTTTGGCT
GGCGTAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG
CCGCTCATGC AGGGGGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT
AACGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGGTAAT TTCTGCCGAT CATGTGCTGG GGTTGCTGCA AAGGGTGCCA
GGTCTCGCGT CATTGAATAT TATTGATACG CAGCTGGTTG AGAACATTAC CGGTCATTTA
TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCATC GCCAAAGCAG CATGAATAAC
CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG
CTGGAACGAC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACTATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC
CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC
GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC
CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC
CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC
GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCGT ATTGTGTCAG
CATCCGGCAG AAGTTATTGC CGGGCTAACA GGATATGAGG CATTTATGGA GTTACTTCAC
AAGGGGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR ISASGSAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GVTRDNLQHL STACDNQHRW
PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH NGLFLGDNAV RTLTGLIEKQ
HQQAQVISAD HVLGLLQRVP GLASLNIIDT QLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA
AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG