Gene EcSMS35_4287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4287 
Symbol 
ID6147311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4389292 
End bp4391040 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content50% 
IMG OID641619108 
Productputative frv operon regulatory protein 
Protein accessionYP_001746232 
Protein GI170683057 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.899072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC ACGCACGCCT 
GGCGAGCTGG CGCAACAGAC TGGCGTTTCC GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTTA CCCTTAACGG CAAAGCCCGC ATTTCCGCCA GTGGCAACGC GGGCTATCAG
CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTGTTATT ACTGAATACC TTCACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGCCTGCCCC GCCTGAAACA GCGCTATGAA
CGCGGTTTTT GTCTCGCTAG TCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG
AAACGCATTA TTTTGCTGGC GAATCTTTTA CGCAAAGATC CGTTTTTAAT TCCGCTGTCG
GGCGTAACAC GAGACAATTT TCAACAATTA ACCACCGCCT GCGAAAAGCA ACGTCGCTGG
CCGCTGATGC AAGGCGACTA TCTCTCAAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAG
AGCGGCCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGGTAAT TTCTGCCGAT CATGTGCTGG GGTTGCTGCA AAGGGTGCCA
GGCATCGCGT CATTGAATAT TATTGATACG CAGTTGGTTG AGAATATTAC CGGGCATTTA
TTACGTTGCC TTGCCGCACC GGTGTGGATT GCCGAGCACC GCCAAAGCAG CATGAATAAC
CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG
CTGGAACGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC
CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCT CAATTAGTCG CACAACACCA TATTACCGCC
GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC
CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC
CTCGCCCAAC CGGTTGAGGT GAATAACGAA GTAGTTAACC ATGTACTGAT CGCCTGTGCC
GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCGT ATTGTGCCAG
CATCCGGCAG AGGTTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC
AAGGGGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR ISASGNAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RGFCLASRPG LGHFIDETEE KRIILLANLL RKDPFLIPLS GVTRDNFQQL TTACEKQRRW
PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEQ SGLFLGDNAV RTLTGLIEKQ
HQQAQVISAD HVLGLLQRVP GIASLNIIDT QLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VVNHVLIACA
AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG