Gene EcSMS35_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1373 
Symbol 
ID6143788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1360068 
End bp1361666 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content51% 
IMG OID641616251 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001743431 
Protein GI170681526 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.295034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0183228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG CACAACGGAT CATTAAAACC TATCGCCGTA ATCGAATGAT TGTTTGTACG 
ATTTGCGCCC TCGTTACGCT CGCTTCGACC CTGAGCGTGC GATTTATTTC ACAGCGTAAC
TTAAATCAAC AACGGGTAGT ACAATTCGCC AATCACGCTG TAGAGGAATT AGATAAAGTA
CTGCTTCCCC TACAGGCAGG TAGCGAAGTC TTGCTTCCGC TGATTGGTCT GCCCTGCTCT
GTCGCCCATT TGCCATTACG TAAACAGGCG GCAAAACTCC AAACTGTGCG ATCCATTGGC
CTGGTGCAAG ACGGCACACT TTATTGCTCC AGCATTTTTG GTTATCGCAA TGTGCCCGTC
GTGGACATTC TGGCTGAACT TCCTGCACCG CAACCACTTT TACGCCTGAC GATCGACCGT
GCCCTGATTA AAGGCAGTCC GGTTTTGATT CAGTGGACGC CAGCAGCGGG CAGTAGCAAA
GCTGGGGTCA TGGAGATGAT TAACATCGAC TTACTGGCGG CAATGCTGCT TGAGCCACAA
CTGCCGCAAA TCAGTAGCGC CAGCCTGACG GTGGACGATC GGCATTTGCT CTATGGTAAT
GGGCTGGTAG ATTCCCTTCC GCAACCTGAA AACAATGAAA ACTACCAGGT TTCTTCGCAA
CGCTTTCCTT TTACCATTAA CGTTAATGGT CCGGGGGCTA CGGCGCTGGC ATGGCACTAT
CTTCCAACAC AATTACCGCT GGCGGTGCTG CTAAGTTTAC TGGTGGGCTA CATCGCCTGG
CTGGCGACCG CTTACCGGAT GAGCTTTTCC CGCGAAATCA ATCTGGGCCT GGCGCAACAT
GAGTTCGAAT TGTTCTGTCA GCCTTTGCTT AATGCGCGCA GCCAGCAATG TATTGGTGTA
GAGATTTTGC TTCGCTGGAA CAATCCGCGT CAGGGCTGGA TTTCACCGGA TGTGTTTATT
CCTATCGCGG AAGAACATCA TTTAATTGTG CCACTGACCC GCTATGTGAT GGCAGAAACC
ATTCGTCAGC GCCATGTTTT CCCGATGAGT AGTCAGTTTC ATGTTGGCAT TAACGTCGCA
CCCAGCCATT TTCGCCGTGG TGTGCTGATA AAAGATCTCA ATCAGTACTG GTTTAGCGCT
CACCCGATTC AGCAACTGAT CCTCGAAATC ACCGAACGCG ATGCCTTACT GGATGTTGAT
TATCGGATTG CCCGCGAGCT ACATCGTAAA AACGTCAAAC TGGCGATTGA TGACTTCGGC
ACCGGCAACA GTTCATTTTC CTGGCTTGAA ACATTACGTC CTGACGTGCT GAAAATTGAT
AAGTCATTTA CCGCAGCTAT AGGTTCTGAC GCGGTTAACT CGACGGTGAC CGATATCATC
ATCGCTCTGG GGCAAAGACT GAATATTGAA CTGGTGGCGG AGGGTGTGGA AACACAAGAA
CAGGCGAAGT ATTTGCGCCG TCATGGGGTG CATATTTTGC AAGGGTATTT GTACGCACAG
CCGATGCCGC TACGTGATTT TCCCAAATGG CTGGCGGGCA GCCAACCGCC GCCCGCCCGG
CATAATGGAC ATATCACGCC CGTTATGCCG TTACGTTAA
 
Protein sequence
MQKAQRIIKT YRRNRMIVCT ICALVTLAST LSVRFISQRN LNQQRVVQFA NHAVEELDKV 
LLPLQAGSEV LLPLIGLPCS VAHLPLRKQA AKLQTVRSIG LVQDGTLYCS SIFGYRNVPV
VDILAELPAP QPLLRLTIDR ALIKGSPVLI QWTPAAGSSK AGVMEMINID LLAAMLLEPQ
LPQISSASLT VDDRHLLYGN GLVDSLPQPE NNENYQVSSQ RFPFTINVNG PGATALAWHY
LPTQLPLAVL LSLLVGYIAW LATAYRMSFS REINLGLAQH EFELFCQPLL NARSQQCIGV
EILLRWNNPR QGWISPDVFI PIAEEHHLIV PLTRYVMAET IRQRHVFPMS SQFHVGINVA
PSHFRRGVLI KDLNQYWFSA HPIQQLILEI TERDALLDVD YRIARELHRK NVKLAIDDFG
TGNSSFSWLE TLRPDVLKID KSFTAAIGSD AVNSTVTDII IALGQRLNIE LVAEGVETQE
QAKYLRRHGV HILQGYLYAQ PMPLRDFPKW LAGSQPPPAR HNGHITPVMP LR