Gene EcolC_0606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0606 
Symbol 
ID6065337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp648220 
End bp649308 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content47% 
IMG OID641600012 
ProductCblD family pilus biogenesis initiator protein 
Protein accessionYP_001723609 
Protein GI170018655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATC GATTGATTGC GGCGATATTG GGCTTGTGTG GTGCGGTTAC TGGCGTTCAG 
GCAGCTCCTA ACGTGACCAG TGAAATTACG TACGATTTGG CATCTGGCAG AGCGGATTAT
TACTTCTGGA ATGAGGAGCC CCCACCGGAG GTGAGTTACA GTACAACATT TTCATTTTTT
CAATGTAGCT ACCCTGATTC ACAGCAGACT TGTACATCAG CAGGTAATAC TTCTGTCGTG
CAAATTTATC TAACTGAAAA ACGCAGCGGT ATGCGCTGGC CGGTTAAACT GAAAGGGTAT
ATGACAGTTC AGGTGTGGGA GGACGGACCG TGTAAGGGGT GGTACGATAA GAAAAGGCTG
GATGATGGGA CGGGTTATCA ATGTAAAGAT ACGATTAATA ACGTTGGTTA TCTGGCTAAA
ACAAAAGTTT TAACTCTGTA TATTGAGCAA GAAGAAATGA AGAAACTGCC GATTGGCGGT
TTATGGGAAG GGAAAGTTAA ACTCCATTTT AGCTACCCGG CAACAGATTA TCAGGCTGAT
ATTAAGCTTA ATGTTCTCGA CCCCAACCAT ATCGACGTGT TCTTCCCGGA GTTCGCCCAC
GCCACGCCAC GGGTGCAGTT AGACTTGCAT CCAACAGGGA GCGTTAATGG CAGCAACTAC
GCGCAAGATC TGACCATGTT GGACATGTGC TTGTACGATG GTTTTAACGG TAATGCCATC
AGTTATGAAA TCATGCTCAA AGATGAAGGG CGACCCGCCG CAGGGCGCAG AGACGGTGAC
TTCTCTATCT ATCGTCAGGG AGGAACCACC ACCGACGAGG GAGAACGCAT TGATTACCGG
GTCAAAATGT ACAACCCGGA AACCGGTGGG CAAATTGATG TGCGCAATAA TGAAAATATG
GTCTGGAACA GCATTAACCT GAAACGTGTG CGTCCGGTCG TATTGCCAGG GATCCGCTAT
GCCGTAATGT GTGTGCCAAC GCCATTAACG CTGGCAGTAG AAAAATTTAG CGTGATGGAC
AAACAGGCTG GATATTACAT GGGGAAATTG TCGGTAATCT TTACGCCTTC CTTGCCAACC
ATCAATTAA
 
Protein sequence
MRNRLIAAIL GLCGAVTGVQ AAPNVTSEIT YDLASGRADY YFWNEEPPPE VSYSTTFSFF 
QCSYPDSQQT CTSAGNTSVV QIYLTEKRSG MRWPVKLKGY MTVQVWEDGP CKGWYDKKRL
DDGTGYQCKD TINNVGYLAK TKVLTLYIEQ EEMKKLPIGG LWEGKVKLHF SYPATDYQAD
IKLNVLDPNH IDVFFPEFAH ATPRVQLDLH PTGSVNGSNY AQDLTMLDMC LYDGFNGNAI
SYEIMLKDEG RPAAGRRDGD FSIYRQGGTT TDEGERIDYR VKMYNPETGG QIDVRNNENM
VWNSINLKRV RPVVLPGIRY AVMCVPTPLT LAVEKFSVMD KQAGYYMGKL SVIFTPSLPT
IN