Gene EcolC_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1574 
Symbol 
ID6065601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1746633 
End bp1749950 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content52% 
IMG OID641600990 
Productputative sensor protein 
Protein accessionYP_001724560 
Protein GI170019606 
COG category[T] Signal transduction mechanisms 
COG ID[COG3447] Predicted integral membrane sensor domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.924629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC 
AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC
ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT
GCCGGGCGCA TGTGGCCGGG AATTGCGCTG AGCTGTTCGC TGGGAAATAT CGCCGCATCC
ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA
GCCGTGGTCG GGGCAGTGCT GCTACGTAAA TTGCTGCCGT GGTATAACCC CTTGCAAAAT
CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCGCCTCT GTTGGGGGGT
GTTCTGGTTG TCCTGCTGAC GCCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA
CTGTCAGAAT CCATCGGCGC TCTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC
TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC GTTAGCCATC
ACACTGACGT TAAGCTGGCT TTCGATGCTG TATCTGCCGT GGCCTTTTAC TTTCATTATT
GTGCTGTTGA TGTGGAGCGC CGTGCGTCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT
ACCACGGTGA TGATGGTGTC GCTGATGATG GCCGCGGATC CCTCCCTGCT TGCTACGCCG
CGTACGTACC TGATGAGCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC
AACATCATGA CCATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC
GAAACCCACT TTCGGAACGC GATGGAATAT TCCGCTATCG GTATGGCGTT AGTGGGCACC
GAGGGACAAT GGCTGCAAAC CAACAAAGCG CTCTGCCAGT TTCTCGGGTA CAGTCAGGAA
GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC
CAACAGGTTG AAAAGCTGAT AAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGCTAC
TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG
GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC
GAACAGGTCA ATCAGCAACT GATGGAGCGC ATCACTCTGG CTAACGAAGC GGGCGGGATT
GGCATCTGGG AGTGGGAGTT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG
CTGTATGAAA TTCCTCCACA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG
CCGGAAGATC GCCAGCATGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC
TTTAAACTGG AATTTCGCAT TACCGTAAAA GACGGTATTC GCCATATCCG CGCCCTCGCC
AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGTCTCC TCGGCATCAA TATGGATATG
ACCGAAGTGA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATTACG
CTTGATTCCA TCGGTGAAGC CGTGGTCTGT ATTGATATGG CGATGAAAAT TACCTTTATG
AATCCAGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC
CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT
GCCGACACCT CACGTTCCGC GATCGAGCAA GATGTGGTGT TGCACTGTCG GAGCGGCGGC
AGTTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT
TCGGTTCTGG TGATTCAGGA CGTTACCGAA TCACGCAAAA TGCTGCGCCA GCTGAGCTAC
AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAACCGCG CCAGTTTTGA GAAACAACTG
CGTATCCTGC TGCAAACGGT AAACAGTACA CATCAGCGAC ATGCCCTGGT GTTTATCGAT
CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCATG CGGCGGGTGA CGCTTTGCTG
CGCGAACTGG CGTCGTTAAT GCTGAGTATG CTGCGTTCCA GCGACGTGCT GGCGCGACTC
GGTGGGGATG AATTTGGTCT GCTGCTACCA GACTGTAATG TTGAAAGCGC GCGTTTTATC
GCTACACGCA TTATCAGTGC CGTGAATGAC TATCACTTTA TATGGGAAGG CCGTGTGCAT
CGGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCAGAA
GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGCCG GGGCCGGGTG
ACGGTTTACG AACCGCAGCA AGCTGCCGCA CATAGCGAAC GGGCGGCGAT GTCGCTTGAT
GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGC TCGCCCACGG TGTCGCTTCG
CCACGGATCC CGGAAGCGCG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAGTTGCGAA
GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC GCTTAGCCAT
GCTCTTGACC GCCGGGTATT CCACGAATTT TTCCAGCAGG CCGCAAAAGC GGTTGCCAGT
AAAGGCATAA GCATCTCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG
AATGATCTGC TTGAGCAGCT GGAAAATAGC CCTCTACCAC CACGGTTATT ACATCTGATT
ATTCCGGCTG AAGCGATTTT AGATCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA
TGTCGGATAG TGCTCAGCCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GCTGAAAGCG
AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGTAATTTG
ATGGATGAGA TGCTGATTAC GATTATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC
ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT
CTGATTTATG GTGAGGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGTAGT
TATTTCGCGA TTAACTGA
 
Protein sequence
MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH 
AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN
LADWLRLALG SAIVPPLLGG VLVVLLTPGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH
YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL
TTVMMVSLMM AADPSLLATP RTYLMSHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES
ETHFRNAMEY SAIGMALVGT EGQWLQTNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL
QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT
EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL
PEDRQHAEKV IRDSLQSRSP FKLEFRITVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM
TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL
LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG
SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID
LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNVESARFI
ATRIISAVND YHFIWEGRVH RVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV
TVYEPQQAAA HSERAAMSLD EQWRMIKENQ LMMLAHGVAS PRIPEARNLW LISLKLWSCE
GEIIDEQTFR RSFSDPALSH ALDRRVFHEF FQQAAKAVAS KGISISLPLS VAGLSSATLV
NDLLEQLENS PLPPRLLHLI IPAEAILDHA ESVQKLRLAG CRIVLSQVGR DLQIFNSLKA
NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD
LIYGEVIADA QPLDLLVNSS YFAIN