Gene EcSMS35_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1716 
Symbol 
ID6143472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1723321 
End bp1724370 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content43% 
IMG OID641616592 
Productdiguanylate cyclase 
Protein accessionYP_001743770 
Protein GI170679992 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00839353 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTATTC GCGGCATGAC GATTAGTTTG CCATGGTTTG CTTTTATCAA TGTCAGTTTT 
GCATTGATAA TTTTGCTGCG CCGTGTTCTG TTTAATGACC TTACTCCACC CTGGTTAAAT
GAAAAAAGCC TTATACATTC TATTGATATA TCTGCAACGG GTATTCTGCT GATTTGCAGT
GGTTTACTAC TCATCCCTCG GCAAAAAACA TTACCTATCC AGGTGCTATT AGTTGCCCTG
AGCCTGCTAT GGTCATGGTG TAGTTATCAC TTTATTGCGT ACTGGACGCT GCAGTTTGCT
TATCCACTAT GTGTATTATT AATGCTTAGT GGCGTTATCG CACTTTATTT CCATACCCCA
TCATTGCTCG CATTTGTCAT TCCCTTATGG TTCACCACTC CAATAGCGAG TCTTATGCTT
AATCAACAGA TCAATATTCA TTTTGCTGTT GTATGGTGTA TTTTTTCACT GGCGCTATAT
GGTGGGCGAC TCATTTTGTT ACGTTGGTTT GAAGAAGCAT GGGTACAAAA CAGTTATAAC
AACCAGTTAA TTAATCGCCT TGATGCATTA GCACATCGTG ATCCTTTAAC TGGGATAGCA
AACCGACGTG CAATGAACAG TATCTTACAC GACGCGATAG ATAACGGCGG GTCATTTGCT
CTTATTATGC TGGATGTCGA TTTTTTTAAA CGTTACAACG ACACTTACGG GCATCCTGCT
GGGGATCGTT GTCTAATACA AGTTGCCGAC GCTCTACAGC GTTCAACTCG CCAGTCGGAG
GATGTTGTTG TCCGTTATGG TGGCGAAGAG TTTGTCATCA TACTTTTTAA TGCAACATTG
CCGGAAGCAG AGACGGTAGC AGCCAGAGTG AAGCAAGAAC TTAAACTGGC AGCTATCCCG
CATCAGGCAT CTGCTGTTAA TGCATTCGTT ACGGTTAGTC AGGGGATTAC CTGTTCTGCT
CCCGCAAAAA CGGCTGAACA AATTATTTCA GATGCCGACG CTGCGTTATA CCGGGCGAAA
GAAAGTGGGC GAAATCGATG GGAAAAATAA
 
Protein sequence
MVIRGMTISL PWFAFINVSF ALIILLRRVL FNDLTPPWLN EKSLIHSIDI SATGILLICS 
GLLLIPRQKT LPIQVLLVAL SLLWSWCSYH FIAYWTLQFA YPLCVLLMLS GVIALYFHTP
SLLAFVIPLW FTTPIASLML NQQINIHFAV VWCIFSLALY GGRLILLRWF EEAWVQNSYN
NQLINRLDAL AHRDPLTGIA NRRAMNSILH DAIDNGGSFA LIMLDVDFFK RYNDTYGHPA
GDRCLIQVAD ALQRSTRQSE DVVVRYGGEE FVIILFNATL PEAETVAARV KQELKLAAIP
HQASAVNAFV TVSQGITCSA PAKTAEQIIS DADAALYRAK ESGRNRWEK