Gene EcolC_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3966 
Symbol 
ID6064501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4356794 
End bp4358380 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content45% 
IMG OID641603379 
ProductEAL domain-containing protein 
Protein accessionYP_001726894 
Protein GI170021940 
COG category[T] Signal transduction mechanisms 
COG ID[COG4943] Predicted signal transduction protein containing sensor and EAL domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0738656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCATC GTGCACGACA CCAATTACTG GCGTTGCCGG GCATTATCTT TTTAGTTCTC 
TTTCCCATCA TTCTTTCGCT ATGGATTGCC TTCCTTTGGG CAAAATCAGA AGTGAATAAT
CAGCTCCGAA CCTTTGCTCA ACTGGCACTG GATAAATCCG AGCTGGTCAT TCGCCAGGCA
GATTTAGTGA GCGATGCAGC TGAACGCTAT CAGGGGCAAG TTTGCACTCC AGCCCATCAA
AAGCGAATGT TGAATATTAT TCGTGGCTAT CTTTATATTA ATGAATTGAT CTATGCCCGT
GATAACCATT TTTTATGCTC ATCGCTGATA GCGCCTGTAA ACGGCTATAC GATTGCACCG
GCCGATTATA AGCGTGAACC TAACGTTTCT ATCTATTATT ACCGCGATAC GCCTTTTTTC
TCTGGCTATA AAATGACCTA TATGCAGCGG GGAAATTATG TGGCGGTTAT CAACCCTCTC
TTCTGGAGTG AAGTGATGTC TGATGACCCG ACATTGCAAT GGGGTGTGTA TGATACGGTG
ACGAAAACCT TTTTCTCGTT AAGCAAAGAG GCCTCGGCAG CAACGTTTTC TCCGCTGATT
CATTTGAAGG ATTTAACCGT ACAAAGAAAT GGCTATTTAT ATGCGACAGT TTATTCGACA
AAACGCCCAA TTGCAGCCAT TGTTGCGACT TCATATCAAC GTCTTATAAC CCATTTTTAT
AATCATCTTA TTTTTGCGTT GCCCGCCGGT ATTTTGGGGA GTCTTGTTCT GCTATTACTC
TGGCTACGTA TTCGACAAAA CTATTTATCT CCCAAACGTA AATTGCAACG CGCCCTCGAA
AAACATCAAC TTTGTCTTTA TTACCAGCCA ATAATCGATA TCAAAACAGA AAAATGTATC
GGCGCTGAAG CGTTGTTACG TTGGCCTGGT GAGCAGGGGC AAATAATGAA TCCGGCAGAG
TTTATTCCGC TGGCAGAAAA GGAGGGGATG ATAGAACAGA TAACTGATTA TGTTATTGAT
AATGTCTTCC GCGATCTGGG CGATTACCTG GCAACACATG CAGATCGCTA TGTTTCTATT
AACCTGTCGG CCTCCGATTT TCATACGTCA CGGTTGATAG CGCGAATCAA TCAGAAAACA
GAGCAATACG CGGTGCGTCC GCAGCAAATT AAATTTGAAG TGACTGAGCA TGCATTTCTT
GATGTTGACA AAATGACGCC GATTATTCTG GCTTTCCGCC AGGCAGGTTA CGAAGTGGCA
ATTGATGATT TTGGTATTGG CTACTCTAAC TTGCATAACC TTAAATCATT GAATGTCGAT
ATTTTGAAAA TCGACAAATC GTTTGTTGAA ACGCTGACCA CCCACAAAAC CAGTCATTTG
ATTGCGGAAC ACATCATCGA GCTGGCGCAC AGCCTGGGGT TAAAAACGAT CGCTGAAGGC
GTCGAAACTG AGGAGCAGGT TAACTGGCTG CGCAAACGCG GCGTGCGCTA TTGCCAGGGA
TGGTTCTTTG CGAAGGCGAT GCCGCCGCAG GTGTTTATGC AATGGATGGA GCAATTACCC
GCGCGGGAGT TAACGCGCGG GCAATAA
 
Protein sequence
MSHRARHQLL ALPGIIFLVL FPIILSLWIA FLWAKSEVNN QLRTFAQLAL DKSELVIRQA 
DLVSDAAERY QGQVCTPAHQ KRMLNIIRGY LYINELIYAR DNHFLCSSLI APVNGYTIAP
ADYKREPNVS IYYYRDTPFF SGYKMTYMQR GNYVAVINPL FWSEVMSDDP TLQWGVYDTV
TKTFFSLSKE ASAATFSPLI HLKDLTVQRN GYLYATVYST KRPIAAIVAT SYQRLITHFY
NHLIFALPAG ILGSLVLLLL WLRIRQNYLS PKRKLQRALE KHQLCLYYQP IIDIKTEKCI
GAEALLRWPG EQGQIMNPAE FIPLAEKEGM IEQITDYVID NVFRDLGDYL ATHADRYVSI
NLSASDFHTS RLIARINQKT EQYAVRPQQI KFEVTEHAFL DVDKMTPIIL AFRQAGYEVA
IDDFGIGYSN LHNLKSLNVD ILKIDKSFVE TLTTHKTSHL IAEHIIELAH SLGLKTIAEG
VETEEQVNWL RKRGVRYCQG WFFAKAMPPQ VFMQWMEQLP ARELTRGQ