Gene EcolC_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4120 
Symbol 
ID6066004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4545445 
End bp4547193 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content50% 
IMG OID641603542 
Productputative frv operon regulatory protein 
Protein accessionYP_001727045 
Protein GI170022091 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT 
GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG
CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA
CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG
AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG
GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG
CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT
AGCGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CGGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGGTAAT TTCAGCCGAT AATGTGCAGG GGTTGCTGCA AAGGGTGCCG
GGCATCGCGT CATTGAATAT TATTGATGCG CAGCTGGTTG AGAATATTAC CGGGCATTTA
TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAGAGCAG CATGAATAAC
CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCTCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG
CTGGAACGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC
CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AACAGCCATT ATTTACTGGA TGACGCGGTC AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTGG CACAACACCA TATTACCGCC
GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC
CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC
CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC
GCCGCCGATG CGCGTCACGA GTTGAAAATA TTCAGCTATC TGGCAAGCGT ATTGTGTCAG
CATCCGGCAG AGGTTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC
AAGGGGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW
PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH SGLFLGDNAV RTLTGLIEKQ
HQQAQVISAD NVQGLLQRVP GIASLNIIDA QLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA
AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG