Gene EcE24377A_4427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4427 
Symbol 
ID5588925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4415989 
End bp4417737 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content50% 
IMG OID640928042 
Productputative frv operon regulatory protein 
Protein accessionYP_001465386 
Protein GI157155910 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT 
GGAGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGTAGTGC GGGCTATCAG
CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTATTATT ACTAAATACC TTTACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACGTG GGTAGCAGAA CGTCTGCCCC GGTTAAAACA GCGTTATGAA
CGCACTTGTT GCCTGGCCAG TCGCCCCGGT TTGGGCCATT TCATTGATGA AACAGAAGAG
AAGCGCGTTA TCTTGCTGGC GAACTTGCTA CGCAAAGATC CGTTTTTAAT TCCGCTGGCG
GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG
CCGCTGATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAGCTTACTG ACATATGGCC GCAATATCCC GGTAACGAGA TAAAACAAAT CGTTGAACAG
TGCGGCCTGT TTCTCGGTGA TAACGCTGTA AGAACCCTGA CGGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGGTAAT TTCAGCCGAT CATGTGCTGG GGTTGCTGCA AAGGGTGCCA
GGCATCGCGT CATTGAATAT TATTGATACG CAGTTGGTTG AGAATATTAC CGGGCATTTA
TTACGTTGCC TTGCCGCACC GGTGTGGATT GCTGAGCACC GCCAAAGCAG CATGAATAAC
CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG
CTGGAACGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACTATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAA ATTGTCGGGT AATTATTGCC
CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AATAGCCATT ATTTACTGGA AGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTGG CACAACACCA TATTACCGCC
GATGAAGCAC AACGCATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC
CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC
CTCGCCCACC CGATCGAGGT GAATAACGAA ATCATTAACC ATGTGTTGAT TGCCTGCGCC
GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCGT ATTGTGTCAG
CATCCGGCAG AGGTTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC
AAGGGGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW
PLMQGDYLSS LILAIYALRN QLTDIWPQYP GNEIKQIVEQ CGLFLGDNAV RTLTGLIEKQ
HQQAQVISAD HVLGLLQRVP GIASLNIIDT QLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLNCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLEDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAHPIEVNNE IINHVLIACA
AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG