Gene EcolC_1183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1183 
Symbol 
ID6066923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1296867 
End bp1297928 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID641600599 
Producthypothetical protein 
Protein accessionYP_001724177 
Protein GI170019223 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0538168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAA TGTTGATGCA ATGGTATCGC CGCCGTTTTA GCGACCCGGA AGCGATTGCC 
TTGCTGGTTA TTTTAGTTGC CGGATTTGGC ATTATCTTTT TCTTTAGTGG CCTGCTTGCT
CCGTTGCTGG TGGCTATTGT GCTGGCCTAT TTGCTGGAAT GGCCAACTGT GCGCCTGCAA
TCTATTGGCT GCTCCCGCCG CTGGGCGACG TCGATTGTAT TGGTGGTTTT TGTCGGTATA
TTGCTACTGA TGGCGTTCGT GGTACTGCCT ATCGCCTGGC AACAGGGCAT CTACTTAATT
CGCGATATGC CGGGGATGCT CAATAAGCTT TCTGACTTTG CCGCCACGTT GCCGCGCCGC
TATCCGGCGT TAATGGATGC GGGCATTATT GATGCAATGG CCGAAAATAT GCGCAGTCGG
ATGCTGACCA TGGGCGATTC GGTGGTGAAA ATTTCCCTCG CCTCGCTGGT CGGTTTGCTG
ACCATAGCCG TCTATCTGGT GCTGGTGCCA TTGATGGTCT TCTTCCTGCT GAAAGACAAA
GAGCAGATGC TGAACGCCGT TCGTCGGGTG CTGCCGCGCA ACCGTGGACT GGCAGGACAG
GTGTGGAAGG AGATGAATCA ACAAATCACC AACTATATCC GCGGCAAAGT GCTGGAGATG
ATCGTGGTGG GGATCGCCAC CTGGCTGGGG TTCTTGCTCT TTGGGCTGAA CTATTCGCTG
CTGCTGGCGG TGCTGGTCGG CTTCTCGGTT CTTATTCCGT ACATTGGCGC ATTTGTGGTG
ACCATTCCGG TGGTTGGCGT GGCGCTATTC CAGTTTGGTG CAGGCACGGA ATTCTGGAGC
TGCTTCGCAG TGTATCTGAT TATTCAGGCG CTGGACGGCA ACCTGTTAGT ACCGGTGTTG
TTCTCCGAAG CGGTTAACCT GCATCCGCTG GTGATTATTT TATCGGTGGT GATCTTCGGT
GGTTTGTGGG GATTCTGGGG CGTATTCTTC GCCATTCCAT TGGCGACCCT GATCAAAGCC
GTGATCCACG CCTGGCCCGA TGGGCAAATA GCGCAAGAAT AA
 
Protein sequence
MLEMLMQWYR RRFSDPEAIA LLVILVAGFG IIFFFSGLLA PLLVAIVLAY LLEWPTVRLQ 
SIGCSRRWAT SIVLVVFVGI LLLMAFVVLP IAWQQGIYLI RDMPGMLNKL SDFAATLPRR
YPALMDAGII DAMAENMRSR MLTMGDSVVK ISLASLVGLL TIAVYLVLVP LMVFFLLKDK
EQMLNAVRRV LPRNRGLAGQ VWKEMNQQIT NYIRGKVLEM IVVGIATWLG FLLFGLNYSL
LLAVLVGFSV LIPYIGAFVV TIPVVGVALF QFGAGTEFWS CFAVYLIIQA LDGNLLVPVL
FSEAVNLHPL VIILSVVIFG GLWGFWGVFF AIPLATLIKA VIHAWPDGQI AQE