Gene EcolC_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2186 
Symbol 
ID6066536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2398017 
End bp2399312 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content33% 
IMG OID641601593 
Producthypothetical protein 
Protein accessionYP_001725152 
Protein GI170020198 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.311872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA TCACTTTAAA CGACAACCAT ATTGCACATT TAAACGCCAA AAACACTACA 
AAACTGGAAT ATTTAAACTT AAGCAATAAC AATTTACTGC CAACCAATGA CATTGATCAA
CTAATATCAT CAAAACATCT TTGGCATGTA TTAGTTAACG GCATCAACAA TGATCCACTT
GCACAAATGC AGTACTGGAC TGCAGTAAGA AATATAATTG ATGACACTAA TGAAGTGACC
ATTGATTTAT CAGGACTTAA TTTAACCACT CAACCACCAG GGCTGCAAAA CTTCACCTCT
ATCAATCTTG ATAATAACCA ACTCACACAT TTTGATGCAA CCAACTACGA TAGACTCGTA
AAGCTAAGTC TGAATAGTAA TACTCTTGAG TCAATAAATT TTCCTCAAGG CAGAAATGTA
AGTATTACAC ATATATCTAT GAATAATAAT GCTCTCAGAA ATATTGATAT AGATAGGCTT
TCATCAGTTA CTTATTTTAG TGCGGCACAT AATCAACTAG AGTTTGTGCA ATTAGAATCT
TGCGAATGGC TGCAATACCT GAATCTCAGC CATAATCAAT TAACTGATAT TGTTGCAGGT
AATAAAAATG AACTCTTACT GCTGGATCTA TCCCATAATA AACTAACAAG TTTACACAAT
GACTTATTTC CCAACTTGAA TACGTTACTT ATTAACAACA ATTTGCTTTC TGAAATTAAA
ATATTCTATA GCAACTTCTG CAATGTTCAG ACATTAAACG CTGCTAACAA CCAGTTGAAA
TATATAAATC TTGATTTCCT GACTTATCTT CCATCTATCA AAAGTTTAAG ACTGGACAAT
AATAAAATAA CCCACATTGA TACTAATAAT ACATCCGATA TTGGAACTTT ATTCCCCATA
ATAAAACAGA GCAAAAACTT AAATTTTTTA AATGTTTCTG GGAAGAACAA TTGCCCTACT
ATGCAGCTCA TGTTATTTAA TTTATTTTCC CCAGCACTTA AGCTTAATAC TGGCCTGGCA
ATTCTTTCGC CTGGTGCATT TGAAGTTCAC TCTGACGGAA TAGATGTGGA TAACGAATTG
TTTCACTATC CTATTAAAAA AGCATATACC CCATATAATA TACACACTTA CAAGACAGAG
GAAGTTGTAA ACCAGAGGAA TATAAAAGTT AAAAACATGA CCTTAGATGA AATAAACAAT
ACTTACTGTA ATAACGATTA TTACAATCAG GCAATAAGAG AGGAACCGAT AGACCTTCTG
GACAGATCGT TTTCCTCCAG TTCATGGCCT TTTTAG
 
Protein sequence
MKTITLNDNH IAHLNAKNTT KLEYLNLSNN NLLPTNDIDQ LISSKHLWHV LVNGINNDPL 
AQMQYWTAVR NIIDDTNEVT IDLSGLNLTT QPPGLQNFTS INLDNNQLTH FDATNYDRLV
KLSLNSNTLE SINFPQGRNV SITHISMNNN ALRNIDIDRL SSVTYFSAAH NQLEFVQLES
CEWLQYLNLS HNQLTDIVAG NKNELLLLDL SHNKLTSLHN DLFPNLNTLL INNNLLSEIK
IFYSNFCNVQ TLNAANNQLK YINLDFLTYL PSIKSLRLDN NKITHIDTNN TSDIGTLFPI
IKQSKNLNFL NVSGKNNCPT MQLMLFNLFS PALKLNTGLA ILSPGAFEVH SDGIDVDNEL
FHYPIKKAYT PYNIHTYKTE EVVNQRNIKV KNMTLDEINN TYCNNDYYNQ AIREEPIDLL
DRSFSSSSWP F