Gene EcolC_0878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0878 
Symbol 
ID6064784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp943073 
End bp944104 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID641600281 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_001723874 
Protein GI170018920 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA TAAAGGATGT AGCCCGACTG GCAGGCGTTT CAGTCGCCAC CGTTTCCCGC 
GTCATTAATA ATTCACCTAA AGCCAGTGAA GCTTCCCGGC TGGCTGTGCA TAGTGCAATG
GAGTCTCTTA GCTATCACCC GAACGCCAAC GCCCGTGCGC TGGCGCAGCA GACCACTGAA
ACGGTCGGTC TGATCGTTGG TGATGTTTCC GATCCGTTTT TCGGTGCAAT GGTGAAAGCG
GTCGAACAGG TGGCTTATCA CACCGGTAAT TTCTTATTGA TTGGCAACGG TTACCACAAC
GAACAAAAAG AGCGTCAGGC TATTGAGCAA CTGATCCGCC ATCGCTGTGC TGCGTTGGTC
GTCCATGCTA AAATGATCCC GGATGCCGAT TTAGCCTCAT TAATGAAACA AATGCCCGGT
ATGGTGCTGA TCAACCGTAT CCTGCCTGGC TTTGAAAACC GTTGTATTGC TCTTGACGAT
CGTTACGGTG CCTGGCTGGC AACGCGTCAT TTAATTCAGC AAGGTCATAC CCGCATTGGT
TATCTGTGCT CTAACCACTC TATTTCTGAC GCCGAAGATC GTCTGCAAGG GTATTACGAT
GCCCTTGCTG AAAGTGGTAT TGCGGCCAAT GACCGGCTGG TGACATTTGG CGAACCAGAC
GAAAGCGGCG GCGAACAGGC AATGACCGAG CTTTTGGGAC GAGGAAGAAA TTTCACTGCG
GTAGCCTGTT ATAACGATTC AATGGCGGCG GGTGCGATGG GCGTTCTCAA TGATAATGGT
ATTGATGTAC CGGGTGAGAT TTCGTTAATT GGCTTTGATG ATGTGCTGGT GTCACGCTAT
GTGCGTCCGC GCCTGACCAC CGTGCGTTAC CCAATCGTGA CGATGGCGAC CCAGGCTGCC
GAACTGGCTT TGGCGCTGGC GGATAATCGC CCTCTCCCGG AAATCACTAA TGTCTTTAGT
CCGACGCTGG TACGTCGTCA TTCAGTGTCA ACTCCGTCGC TGGAGGCAAG TCATCATGCA
ACCAGCGACT AA
 
Protein sequence
MATIKDVARL AGVSVATVSR VINNSPKASE ASRLAVHSAM ESLSYHPNAN ARALAQQTTE 
TVGLIVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV
VHAKMIPDAD LASLMKQMPG MVLINRILPG FENRCIALDD RYGAWLATRH LIQQGHTRIG
YLCSNHSISD AEDRLQGYYD ALAESGIAAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA
VACYNDSMAA GAMGVLNDNG IDVPGEISLI GFDDVLVSRY VRPRLTTVRY PIVTMATQAA
ELALALADNR PLPEITNVFS PTLVRRHSVS TPSLEASHHA TSD