Gene EcSMS35_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1983 
Symbol 
ID6143381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2002860 
End bp2004320 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content45% 
IMG OID641616859 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001744035 
Protein GI170683384 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.123377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACATCC AGCTCTGGTA TTCCGCAAAA GCAGAGTACC TGGCAGGAGC GAGATATGCC 
GCCAACAATA TCAATCATAT ACTTGAAGAA GCGTCACAGG CGACTCAAAC AGCGGTTAAC
ATTGCCGGGA AGGAATGCGA CCTCGAGGAG CAATATCAGC TTGGCACTGA AGCAGCTCTG
AAACCTCACC TGCGCACAAT CATCATTCTC AAACAGGGAA CGGTCTGGTG TACATCCTTA
CCGGGGAATC GGGTCCTGCT GTCTCGTATT CCTGTTTTCC CGGACAGTAA TTTACTGTTG
GCTCCGGCAA TCGACACCGT TAATAGATTA CCTATCCTGC TCTATCAGAG CCAATTTGCA
GATACGCGCA TTTTGGTTAC GATAAGTGAT CAGCATATTC GCGGGGCGCT TAATGTACCC
TTGAAAGGGG TAAGGTATGT ATTACGCGTG GCGGATGACA TTATTGGACC TACGGGTGAT
GTGATGACGC TTAATGGACA TTATCCCTAT ACCGAGAAGG TTCACTCCAC AAAATATCAT
TTCACTATTA TCTTTAACCC GCCACCACTC TTTAGCTTCT ACAGACTTAT CGATAAAGGC
TTTGGGATAT TGATATTTAT TCTGTTAATC GCCTGCGCCG CTGCTTTGCT GCTTGATAGG
TATTTCAATA AAAGCGCAAC GCCTGAAGAG ATCCTGCGAC GGGCTATAAA TAATGGGGAG
ATCGTCCCTT TTTACCAACC TGTGGTAAAT GGTCGGGAAG GGACATTGCG GGGAGTTGAG
GTGTTAGCCC GCTGGAAACA ACCTCACGGT GGATATATAT CACCCGCGGC ATTTATTCCA
CTTGCTGAAA AATCCGGATT AATCGTTCCG CTTACGCAAA GCCTGATGAA TCAGGTTGCC
AGACAGATGA ACGCTATCTC GAGTAAACTG CCGGAGGGTT TTCATATTGG AATTAATTTT
AGCGCCTCGC ATATTATTTC GCCGACGTTT GTCGACGAGT GCTTAAATTA CCGTGACAGT
TTTACCCGCC GCGATTTAAA CCTCGTTCTG GAAGTCACCG AGCGTGAGCC ATTAAATGTT
GATGAAAGTC TGGTTCAGCG GTTGAACATT CTGCATGAAA ATGGTTTTGT CATCGCGCTG
GATGATTTCG GTACTGGCTA CTCAGGGCTT TCTTATCTGC ATGACCTGCA TATTGATTAT
ATCAAAATTG ATCACAGTTT CGTTGGCCGC GTCAACGCAG ACCCAGAATC AACCCGAATT
CTGGATTGTG TATTGGATCT GGCGCGTAAA CTTTCGATCA GTATCGTCGC TGAAGGTGTC
GAAACGAAAG AACAACTTGA TTATCTGAAC CAAAATAATA TCACATTTCA GCAGGGTTAT
TATTTCTATA AACCTGTTAC ATACATCGAC CTGGTCAAAA TTATCCTTTC TAAACCGAAG
GTGAAGGTTG TGGTTGAGTG A
 
Protein sequence
MNIQLWYSAK AEYLAGARYA ANNINHILEE ASQATQTAVN IAGKECDLEE QYQLGTEAAL 
KPHLRTIIIL KQGTVWCTSL PGNRVLLSRI PVFPDSNLLL APAIDTVNRL PILLYQSQFA
DTRILVTISD QHIRGALNVP LKGVRYVLRV ADDIIGPTGD VMTLNGHYPY TEKVHSTKYH
FTIIFNPPPL FSFYRLIDKG FGILIFILLI ACAAALLLDR YFNKSATPEE ILRRAINNGE
IVPFYQPVVN GREGTLRGVE VLARWKQPHG GYISPAAFIP LAEKSGLIVP LTQSLMNQVA
RQMNAISSKL PEGFHIGINF SASHIISPTF VDECLNYRDS FTRRDLNLVL EVTEREPLNV
DESLVQRLNI LHENGFVIAL DDFGTGYSGL SYLHDLHIDY IKIDHSFVGR VNADPESTRI
LDCVLDLARK LSISIVAEGV ETKEQLDYLN QNNITFQQGY YFYKPVTYID LVKIILSKPK
VKVVVE