Gene SeHA_C3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3931 
Symbol 
ID6491603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3804831 
End bp3806837 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content54% 
IMG OID642744037 
Productputative phosphodiesterase 
Protein accessionYP_002047643 
Protein GI194449672 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGTCA GCCGCTCGTT AACAATTAAA CAGATGGCAA TGGTTGCGGC CGTTGTCATG 
GTGTTTGTTT TTGTCTTTTG CACCGTTTTG TTGTTCCATC TGGTACAGCA GAACCGCTAC
AACACGGCTA CGCAACTGGA AAGTATCGCG CGATCTGTTC GGGAACCTCT TTCTTCCGCG
ATTCTAAAAG CGGATCTCCC CGGCGCGGAA ACCATTCTGG AAAGTATTAA ACCTGCGGGC
GTGGTGAGTC GCGCCGATGT GGTATTGCCG AATCAGTTCC AGGCGTTGCG TAAGCGCTTC
ATTCCTGAAC GCCCCGTCCC GGTTATGGTG ACGCGTCTCT TCGAACTGCC GGTACAAATT
TCTTTGCCGG TTTATTCGTT GGAGCGTCCT GCCAATCCGC AACCGCTGGC CTACCTTGTA
TTGCAGGCGG ATTCGTACCG TATGTACAAG TTCGTCATGA GCGCGCTCTC TACGTTAGTG
ACCATTTACT TACTTTTATC GCTGATCCTG ACGGTGGCCA TCGCCTGGTG CGTAAACCGC
CTGATTGTGC ATCCGCTGCG CAAAATCGCC CGCGAGCTGA ACGACATTCC GCAGCAGGAG
CTGATCGGAC ATCAGCTGGC GTTGCCGCGT CTGCATCAGG ATGATGAAAT TGGGATGCTG
GTGCGCAGCT ATAACCTCAA CCAGCAGCTT ATGCAGCGTC AACGCGAGGA GCAAACGGAC
AACGCGATGC GTTTTCCGGT TTCCGAGCTG CCCAATAAAG CCTTTTTAAT GGCATTGCTG
GAACAGGTTA TCACCCGCCA ACAGACCACC GCGCTTATCA TCGTGACGTG CGAAACGTTG
CGTGACACGG CAGGCGTGCT GCAAGAAACG CAGCGGGAGA TTCTATTACT GACGCTGGTT
GAGAAGCTGA AGTCGGTGCT GGCGCCGCGC ATGGTGCTTA CGCAGGTCAG CGGGTATGAC
TTTGCCATTA TCGCCCACGG CGTTAAAGAG CCGTGGCACG CCATCACATT AGGTCAGCAA
ATACTCACTA TCATTAATGA ACGACTGCCC ATCCAGGGTA TTCAACTGCG CCCAAGCTGC
AGTATTGGCA TTGCGATGTA TTATGGCGAT CTGACCGCCG AAGAGCTCTA TGGTCGCGCC
GTCTCCGCCG CCTTTACCGC GCGCCGAAAA GGTAAAAATC AGATCCAGTT CTTTGACCCG
GCGCAGATGG AGGCCGCTCA ACAGCGCCTT ACCGAAGAGA GCGATATCCT TACCGCGCTG
GATAACCATC AGTTTGCCAT TTGGCTGCAG CCGCAGGTCG AGATGCGCAG CGGCAACGTA
TTAAGCGCCG AAGCCTTGTT ACGTATGCAA CAGCCGGACG GTAGCTGGGA ATTACCAGAT
GGGCTGATTG AGCGCATTGA ATCCTGCGGC CTGATGGTCA CGGTGGGCTA TTGGGTGCTG
GAAGAGTCCT GCCGCCAGCT TGCCGCCTGG CAGGAGCGCG GCGTGACATT GCCGCTCTCC
GTCAATCTTT CCGCGTTACA GCTCATGCAC CCTGGCATGG TGTCGGAGCT GCTGGAATTG
TTAAACCGCT ATCGTATTCA ACCAGGTACG CTGATTCTTG AGGTCACTGA AAGCCGCCGT
ATCGACGATC CGCACGCTGC CGTCGCTATC TTACGTCCGT TACGTAATGC TGGCGTGCGT
ATCGCACTGG ATGATTTTGG CATGGGTTAC GCGGGGCTGC GCCAGTTACA GCATATGAAG
TCGCTACCGG TCGATATCCT TAAAATTGAT AAAATGTTTG TCGATGGGTT ACCGGATGAT
CACAGTATGG TGACGGCGAT TATTCTGATG GCCCGCAGTC TTAATTTACA ATTGATTGCC
GAGGGCGTGG AGAACGAGGC GCAACGCGCG TGGCTGGAAC AGGCGGGAGT CAACGTCGCG
CAAGGCTTCC TGTTTGCTCG GCCCGTTCCC GCGGATATCT TTGAAGAACG GTATCTGTCG
CACGAAAATC CTGATTACAA AAGTTAA
 
Protein sequence
MRVSRSLTIK QMAMVAAVVM VFVFVFCTVL LFHLVQQNRY NTATQLESIA RSVREPLSSA 
ILKADLPGAE TILESIKPAG VVSRADVVLP NQFQALRKRF IPERPVPVMV TRLFELPVQI
SLPVYSLERP ANPQPLAYLV LQADSYRMYK FVMSALSTLV TIYLLLSLIL TVAIAWCVNR
LIVHPLRKIA RELNDIPQQE LIGHQLALPR LHQDDEIGML VRSYNLNQQL MQRQREEQTD
NAMRFPVSEL PNKAFLMALL EQVITRQQTT ALIIVTCETL RDTAGVLQET QREILLLTLV
EKLKSVLAPR MVLTQVSGYD FAIIAHGVKE PWHAITLGQQ ILTIINERLP IQGIQLRPSC
SIGIAMYYGD LTAEELYGRA VSAAFTARRK GKNQIQFFDP AQMEAAQQRL TEESDILTAL
DNHQFAIWLQ PQVEMRSGNV LSAEALLRMQ QPDGSWELPD GLIERIESCG LMVTVGYWVL
EESCRQLAAW QERGVTLPLS VNLSALQLMH PGMVSELLEL LNRYRIQPGT LILEVTESRR
IDDPHAAVAI LRPLRNAGVR IALDDFGMGY AGLRQLQHMK SLPVDILKID KMFVDGLPDD
HSMVTAIILM ARSLNLQLIA EGVENEAQRA WLEQAGVNVA QGFLFARPVP ADIFEERYLS
HENPDYKS