Gene EcSMS35_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1986 
Symbol 
ID6143267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2006236 
End bp2007447 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content43% 
IMG OID641616862 
ProductBLUF/cyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001744038 
Protein GI170680510 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.758889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACCA CCCTTATTTA TCGTAGCCAT ATACGTGACG ACGAACCTGT CAAAAAAATC 
GAAGAAATGG TTTCGATAGC AAATCGCAGG AACATGCAGT CTGACGTAAC AGGGATCTTA
CTGTTTAATG GTTCTCATTT TTTCCAGCTT CTGGAAGGTC CGGAAGAACA GGTTAAAATG
ATATATCGGG CTATATGCCA GGATCCACGG CACTATAATA TTGTTGAGCT GATGTGCGAT
TACGCGCCTG CTCGCCGTTT TGGCAAAGCG GGAATGGAAT TATTTGATTT GCGCCTGCAC
GAGCGAGATG ACGTTTTACA GGCCGTATTC GACAAAGGCA CATCAAAATT TCAGCTAACT
TATGATGACA GAGCGCTACA ATTTTTTCGT ACTTTTGTCC TTGCAACCGA ACAAGCAACC
TATTTCGAGA TCCCTGCCGA AGACTCCTGG CTTTTTATCG CTGACGGATC TGATAAAGAA
CTTGATTCCT GTACCCTTTC ACCAACTATA AATGACCACT TTGCCTTTCA TCCTATTGTC
GATCCCTTAT CGCGGCGGAT AATCGCTTTT GAAGCCATTG TGCAAAAAAA TGAAGATAGC
CCATCAGCTA TAGCGGTTGG GCAGCGTAAA GACGGGGAAA TCTACAAAGC GGATCTCAAA
AGTAAGGCGC TTGCATTCGC GATGGCACAC GCACTTGAGC TCGGTGATAA AATGATTTCA
ATCAATCTAT TACCTATGAC CCTGGTTAAC GAACCTGACG CAGTCTCTTT TTTACTTGAT
GAAATAAAGG CCAATGCTCT GGTGCCTGAA CAAATCATCG TTGAATTTAC TGAAAGTGAA
GTCATATCTC GGTTTGATGA GTTTGCAGAA GCGATTAAAT CGCTTAAGGC TGCCGGTATC
AGTGTAGCAA TTGATCATTT TGGCGCAGGT TTTGCCGGTT TGTTACTCCT GTCACGCTTC
CAGCCTGACA GAATTAAAAT CAGTCAGGAA TTGATTACCA ATGTTCATAA AAGCGGGCCA
CGGCAGGCAA TTATTCAAGC GATCATAAAA TGCTGTACAT CACTTGAAAT TCAAGTCTCT
GCTATGGGTG TGACAACACC AGAAGAGTGG ATGTGGCTTG AATCTGCAGG AATTGAGATG
TTTCAGGGAG ATCTGTTTGC GAAAGCTAAA TTGAATGGTA TCCCTTCAGT TGCGTGGCCG
GAGAAAAAAT AA
 
Protein sequence
MLTTLIYRSH IRDDEPVKKI EEMVSIANRR NMQSDVTGIL LFNGSHFFQL LEGPEEQVKM 
IYRAICQDPR HYNIVELMCD YAPARRFGKA GMELFDLRLH ERDDVLQAVF DKGTSKFQLT
YDDRALQFFR TFVLATEQAT YFEIPAEDSW LFIADGSDKE LDSCTLSPTI NDHFAFHPIV
DPLSRRIIAF EAIVQKNEDS PSAIAVGQRK DGEIYKADLK SKALAFAMAH ALELGDKMIS
INLLPMTLVN EPDAVSFLLD EIKANALVPE QIIVEFTESE VISRFDEFAE AIKSLKAAGI
SVAIDHFGAG FAGLLLLSRF QPDRIKISQE LITNVHKSGP RQAIIQAIIK CCTSLEIQVS
AMGVTTPEEW MWLESAGIEM FQGDLFAKAK LNGIPSVAWP EKK