Gene SeD_A3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3991 
Symbol 
ID6875494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3831489 
End bp3833495 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content55% 
IMG OID642786947 
Productputative phosphodiesterase 
Protein accessionYP_002217575 
Protein GI198242458 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.185818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGTCA GCCGCTCGTT AACAATTAAA CAGATGGCAA TGGTTGCGGC CGTTGTCATG 
GTGTTTGTTT TTGTCTTTTG CACCGTTTTG TTGTTCCATC TGGTACAGCA GAACCGCTAC
AACACGGCTA CGCAACTGGA AAGCATCGCG CGATCTGTTC GGGAACCTCT TTCTTCCGCG
ATTTTAAAAG CGGATCTCCC CGGCGCGGAA ACCATTCTGG AAAGTATTAA ACCTGCGGGC
GTGGTGAGTC GCGCCGATGT GGTATTGCCG AATCAGTTCC AGGCGTTGCG TAAGCGCTTC
ATTCCTGAAC GCCCCGTCCC GGTTATGGTG ACGCGTCTCT TCGAACTGCC GGTACAAATT
TCTTTGCCGG TTTATTCGCT GGAGCGTCCC GCCAATCCGC AACCGCTGGC CTACCTTGTA
TTGCAGGCGG ATTCGTACCG TATGTACAAG TTCGTCATGA GCGCGCTCTC TACGTTAGTG
ACCATTTACT TACTTTTATC GCTGATCCTG ACGGTGGCCA TCGCCTGGTG CGTAAACCGC
CTGATTGTGC ATCCGCTGCG CAAAATCGCC CGCGAGCTGA ACGACATTCC GCAGCAGGAG
CTGATCGGAC ATCAGCTGGC GTTGCCGCGT CTGCATCAGG ATGATGAAAT TGGAATGCTG
GTGCGCAGCT ATAACCTCAA CCAGCAGCTT ATGCAGCGTC AACGCGAGGA GCAAACGGAC
AACGCGATGC GTTTTCCGGT TTCCGAGCTG CCCAATAAAG CCTTTTTAAT GGCATTGCTG
GAACAGGTTA TCGCCCGCCA ACAGACCACC GCGCTTATCA TCGTGACGTG CGAAACGTTG
CGTGACACGG CAGGCGTGCT GCAAGAAACG CAGCGGGAGA TTCTATTACT GACGCTGGTT
GAGAAGCTGA AGTCGGTGCT GGCGCCGCGC ATGGTGCTTA CGCAGGTCAG CGGGTATGAC
TTCGCCATTA TCGCCCACGG CGTTAAAGAG CCGTGGCACG CCATCACATT AGGTCAGCAA
ATACTCACTA TCATTAATGA ACGACTGCCC ATCCAGGGTA TTCAACTGCG CCCAAGCTGC
AGTATTGGCA TTGCGATGTA TTATGGCGAT CTGACCGCCG AAGCGCTCTA TGGTCGCGCC
GTTTCCGCCG CGTTTACCGC GCGCCGAAAA GGTAAAAATC AGATCCAGTT CTTTGACCCG
GCGCAGATGG AGGCCGCTCA ACAGCGCCTT ACCGAAGAGA GCGATATCCT TACCGCGCTG
GATAACCATC AGTTTGCCAT TTGGTTGCAG CCGCAGGTCG AGATGCGCAG CGGCAACGTA
TTAAGCGCCG AAGCCTTGTT ACGTATGCAA CAGCCGGACG GTAGCTGGGA ATTGCCGGAG
GGGCTGATTG AGCGCATTGA ATCCTGCGGC CTGATGGTCA CGGTGGGCCA TTGGGTGCTG
GAAGAGTCCT GCCGCCAGCT TGCCGCCTGG CAGGAGCGCG GCGTGACATT GCCGCTCTCC
GTCAATCTTT CCGCGTTACA GCTCATGCAC CCTGGCATGG TGTCGGATCT GCTGGAATTG
TTAAATCGCT ATCGTATTCA ACCGGGTACG CTGATTCTTG AGGTCACTGA AAGCCGCCGT
ATCGACGATC CGCACGCTGC CGTCGCTATC TTACGTCCGT TACGTAATGC TGGCGTGCGT
ATCGCACTGG ATGATTTTGG CATGGGTTAC GCGGGGCTGC GCCAGTTACA GCATATGAAG
TCGCTACCGG TCGATATCCT TAAAATTGAT AAAATGTTTG TCGATGGGTT ACCGGATGAT
CACAGTATGG TGACGGCGAT TATTCTGATG GCCCGCAGTC TTAATTTACA ATTGATTGCC
GAGGGCGTGG AGAACGAGGC GCAACGCGCG TGGCTGGAAC AGGCGGGAGT CAACGTCGCG
CAAGGCTTCC TGTTTGCTCG GCCCGTTCCC GCGGATATCT TTGAAGAACG GTATCTGTCG
CACGAAAATT CTGATTACAA AAGTTAA
 
Protein sequence
MRVSRSLTIK QMAMVAAVVM VFVFVFCTVL LFHLVQQNRY NTATQLESIA RSVREPLSSA 
ILKADLPGAE TILESIKPAG VVSRADVVLP NQFQALRKRF IPERPVPVMV TRLFELPVQI
SLPVYSLERP ANPQPLAYLV LQADSYRMYK FVMSALSTLV TIYLLLSLIL TVAIAWCVNR
LIVHPLRKIA RELNDIPQQE LIGHQLALPR LHQDDEIGML VRSYNLNQQL MQRQREEQTD
NAMRFPVSEL PNKAFLMALL EQVIARQQTT ALIIVTCETL RDTAGVLQET QREILLLTLV
EKLKSVLAPR MVLTQVSGYD FAIIAHGVKE PWHAITLGQQ ILTIINERLP IQGIQLRPSC
SIGIAMYYGD LTAEALYGRA VSAAFTARRK GKNQIQFFDP AQMEAAQQRL TEESDILTAL
DNHQFAIWLQ PQVEMRSGNV LSAEALLRMQ QPDGSWELPE GLIERIESCG LMVTVGHWVL
EESCRQLAAW QERGVTLPLS VNLSALQLMH PGMVSDLLEL LNRYRIQPGT LILEVTESRR
IDDPHAAVAI LRPLRNAGVR IALDDFGMGY AGLRQLQHMK SLPVDILKID KMFVDGLPDD
HSMVTAIILM ARSLNLQLIA EGVENEAQRA WLEQAGVNVA QGFLFARPVP ADIFEERYLS
HENSDYKS