Gene EcolC_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0178 
Symbol 
ID6068230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp194761 
End bp196440 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content53% 
IMG OID641599580 
Producthypothetical protein 
Protein accessionYP_001723187 
Protein GI170018233 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC GGGATATCTT
AACTTCCATC CGCTCCTCAA TTTGGTGTTT GCCGCGTTTC TGCTGATGCC CCTTCCGCGC
TACAGCCTGC ATCGCTTGCG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG
CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG
TTCAATACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG
GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT
TTTGTGGTTG CCATACTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC
TTGTGGCCAG CCGGACAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA
ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG
CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT
AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT
AACATCTGTT CGCTTTCCTG GTCGGATATA GAAGCCGCCG GGTTGATGTC GCATCCACTG
TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCTA CAGTGGCCCG
GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA
CCGGCAAATA ACGACTGCTA TCTGTTTGAT AACCTTTCGA AACTGGGCTT TACCCAGCAC
CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGC
GGCATGCAGA GCGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT
GGTTCGCCGG TTTATGACGA TACCGCTGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA
GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT
TATCCGGGGG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA
CTGGACGCCT TCTTTACTGA ACTTGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG
CCGGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC
CCTAGCCCGT CTATCACCGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCG
CATCAGGGGG CACCGATTGT CATCGAACAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG
GTGGTTCGCG TTCTCGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC
ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCCGAGAACT CAAATGCAGT AGTTATTCAA
TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPLPR 
YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FNTDYLIDLV TRFINWQMIG
AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA
TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI
NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSYSGP AAIRLLRASC GQTSHTNLYQ
PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQSELMDQT NLPVILLGFD
GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP
HQGAPIVIEQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ
YQDKPYVRLN GGDWVPYPQ