Gene EcSMS35_0415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0415 
SymboladrA 
ID6144162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp426089 
End bp427204 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content54% 
IMG OID641615311 
Productdiguanylate cyclase AdrA 
Protein accessionYP_001742518 
Protein GI170683904 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCCAA AAATAATGAA TGATGAAAAC TTTTTCAAAA AAGCGGCGGC GCATGGGGAG 
GAACCTCCTT TAACTCCTCA AAACGAACAT CAGCGATCCG GGCTGCGCTT CGCTCGTCGC
GTCAGACTAC CCCGCGCGGT TGGCCTGGCT GGCATGTTCT TACCGATTGC TTCAACGCTG
GTTTCGCACC CGCCGCCGGG CTGGTGGTGG CTGGTGTTGG TCGGCTGGGC GTTCGTCTGG
CCGCATTTAG CCTGGCAGAT AGCGAGCAGG GCCGTCGATC CGCTTAGCCG GGAAATTTAC
AACTTAAAAA CCGATGCAGT ATTAGCGGGA ATGTGGGTAG GCGTAATGGG CGTAAACGTG
CTGCCTTCCA CCGCGATGTT GATGATTATG TGTCTGAATT TGATGGGGGC AGGCGGCCCC
CGTCTGTTTG TCGCGGGTCT GGTGTTGATG GTGGTTTCCT GCCTTGTCAC CCTCGAACTG
ACGGGCATCA CCGTGTCGTT CAATAGTGCG CCGCTGGAAT GGTGGCTCTC CCTTCCCATT
ATCGTCATTT ATCCTCTGCT GTTTGGCTGG GTCAGCTACC AGACGGCGAC TAAACTGGCG
GAACACAAAC GCAGGTTGCA GGTCATGAGT ACCCGCGACG GTATGACGGG CGTGTATAAC
CGACGTCATT GGGAAACTAT GTTACGCAAT GAATTTGATA ACTGTCGGCG GCATAACCGC
GATGCGACAT TACTGATTAT CGATATCGAC CATTTCAAGA GCATCAACGA TACCTGGGGC
CATGATGTGG GCGATGAAGC GATTGTGGCG CTTACCCGAC AGTTACAAAT AACCCTGCGC
GGTAGCGATG TGATTGGTCG GTTTGGCGGC GATGAGTTTG CGGTAATCAT GTCCGGTACG
CCAGCTGAGA GCGCCATTAC CGCCATGTTA CGAGTGCATG AAGGGCTAAA TACATTACGT
CTGCCGAATA CGCCACAGGT AACTTTACGG ATTAGTGTGG GGGTTGCGCC GCTGAACCCA
CAAATGAGTC ACTATCGTGA GTGGTTGAAA TCGGCAGATT TGGCGCTTTA CAAAGCAAAG
AAAGCCGGAC GTAACCGCAC CGAAGTGGCG GCCTGA
 
Protein sequence
MFPKIMNDEN FFKKAAAHGE EPPLTPQNEH QRSGLRFARR VRLPRAVGLA GMFLPIASTL 
VSHPPPGWWW LVLVGWAFVW PHLAWQIASR AVDPLSREIY NLKTDAVLAG MWVGVMGVNV
LPSTAMLMIM CLNLMGAGGP RLFVAGLVLM VVSCLVTLEL TGITVSFNSA PLEWWLSLPI
IVIYPLLFGW VSYQTATKLA EHKRRLQVMS TRDGMTGVYN RRHWETMLRN EFDNCRRHNR
DATLLIIDID HFKSINDTWG HDVGDEAIVA LTRQLQITLR GSDVIGRFGG DEFAVIMSGT
PAESAITAML RVHEGLNTLR LPNTPQVTLR ISVGVAPLNP QMSHYREWLK SADLALYKAK
KAGRNRTEVA A