Gene EcSMS35_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1397 
Symbol 
ID6142933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1381570 
End bp1382595 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content49% 
IMG OID641616275 
ProductGAF domain/diguanylate cyclase domain-containing protein 
Protein accessionYP_001743455 
Protein GI170679654 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.210623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATC AGATTATCGC CCGCGTCTCG CAATCCCTTG CCAAAGAACA GTCACTGGAA 
AGCCTGGTCC GACAGCTTCT GGAGATGCTG GAAATGGTCA CTGATATGGA ATCAACCTAC
CTGACCAAAG TGGATGTCGA AGCGCGCCTG CAGCATATAA TGTTTGCCCG TAACAGCCAG
AAAATGCACA TCCCGGAGAA TTTTACCGTC TCGTGGGATT ATTCGTTATG CAAACGCGCC
ATTGATGAAA ACTGCTTTTT CAGCGATGAA GTCCCCGACC GTTGGGGCGA CTGTATTGCG
GCACGCAATC TTGGCATCAC CACATTTCTG AGCACGCCAA TTCACTTACC GGATGGATCA
TTCTATGGCA CGCTTTGCGC CGCCAGCAGT GAGAAGCGCC AGTGGAGTGA ACGCGCGGAA
CAGGTTTTGC AGTTATTCGC CGGACTGATT GCACAATATA TTCAAAAAGA GGCGCTGGTA
GAACAGCTGC GCGAAGCCAA TGCCGCACTG ATAGCGCAAT CGTATACCGA CTCGTTAACC
GGGCTACCGA ATCGGCGGGC GATTTTTGAA AATCTGACGA CGCTGTTTTC TCTCGCCCGG
CATCTTAACC ATAAGATAAT GATCGCGTTT ATCGACCTGG ATAACTTCAA ATTAATCAAC
GATCGTTTTG GTCATAATAG TGGCGATCTG TTTCTCATTC AGGTTGGCGA GCGCCTTAAT
ACGCTCCAGC AAAATGGCGA AGTTATTGGT CGTCTCGGCG GTGATGAGTT TTTGGTTGTT
TCACTGAACA ACGAGAATAC GGATATTTCG TCGCTGCGAG AACGTATTCA ACAGCAAATA
CGTGGAGAAT ATCACTTAGG TGATGTTGAT TTGTATTATC CCGGTGCCAG TCTTGGCATA
GTAGAAGTCG ATCCTGAAAC GACCGATGCA GACAGTGCCC TGCATGCTGC CGATATCGCG
ATGTATCAGG AGAAAAAACA CAAACAGAAA ACACCTTTTG TCGCGCATCC AGCGCTACAT
TCCTGA
 
Protein sequence
MSDQIIARVS QSLAKEQSLE SLVRQLLEML EMVTDMESTY LTKVDVEARL QHIMFARNSQ 
KMHIPENFTV SWDYSLCKRA IDENCFFSDE VPDRWGDCIA ARNLGITTFL STPIHLPDGS
FYGTLCAASS EKRQWSERAE QVLQLFAGLI AQYIQKEALV EQLREANAAL IAQSYTDSLT
GLPNRRAIFE NLTTLFSLAR HLNHKIMIAF IDLDNFKLIN DRFGHNSGDL FLIQVGERLN
TLQQNGEVIG RLGGDEFLVV SLNNENTDIS SLRERIQQQI RGEYHLGDVD LYYPGASLGI
VEVDPETTDA DSALHAADIA MYQEKKHKQK TPFVAHPALH S