Gene EcHS_A4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4125 
Symbol 
ID5593412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4116294 
End bp4118048 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content50% 
IMG OID640923228 
Productputative frv operon regulatory protein 
Protein accessionYP_001460687 
Protein GI157163369 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT 
GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG
CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA
CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG
AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG
GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG
CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAA
AGCGGCATGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGATAAT TTCTGCCGAT AATGTGCAGC GGTTGCTGCA AAGGGTGCCG
GGCATCGCGT CATTGAATAT TATTGATACG CGGCTGGTTG AGAACATTAC CGGGCATTTA
TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAAAGCAG CATGAATAAC
CTGAAAGCAG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGCTCG GTTTGTATTT TGCCTGTGCG
CTGGAGCGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAA ATTGTCGGGT GATTATTGCC
CGTAGCTTAA GCGAGCTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC
GATGAAGCAC AACGCATCAT CGCCCGCGAA GGCGAAGGTG AAGGTGAAAA CCTGATTGTT
AATCGCCTCG CCATCCCACA TTGCTGGAGC GAACAGGAGC GACGTTTTCG TGGATTTTTT
ATTACCCTCG CCCAACCAGT TGAGGTGAAT AACGAAGTCA TTAACCATGT CTTGATCGCC
TGCGCCGCCG CCGATGCGCG TCATGAGCTA AAAATATTTA GCTATCTGGC AAGCATATTG
TGTCAGCATC CGGCGGAGAT TATTGCCGGG TTAACAGGAT ATGAGGCATT TATGGAGTTA
CTTCACAAGG GGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW
PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEQ SGMFLGDNAV RTLTGLIEKQ
HQQAQIISAD NVQRLLQRVP GIASLNIIDT RLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLLGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLNCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGEGENLIV NRLAIPHCWS EQERRFRGFF ITLAQPVEVN NEVINHVLIA
CAAADARHEL KIFSYLASIL CQHPAEIIAG LTGYEAFMEL LHKG