Gene SeD_A2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2073 
Symbol 
ID6875654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2007208 
End bp2008701 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content45% 
IMG OID642785186 
Productggdef domain-containing protein 
Protein accessionYP_002215852 
Protein GI198244426 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.986087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.000116677 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTTGC ATCATAAAGC GCTCAGGCAC TTTATCTCGG CAAGCGTCAT CGTTTTGACA 
TCGTCCTTCC TGATTTACGA ACTTATTGCC AGCGACAGGG CAATGAATGC CTATATGCGT
TATATCATGG AGAGAGCCGA TTCGTCGTTT TTGTACGATA AGTACCAGAA TCAGAGCATC
GCCGCACATT TGATGCGAAC TTTTGAAGCG CCGGGAGACC CCGTCACTGC AGAAAAACGC
CGCGCGTTTT GCGACGCCTT TGAGGCTATT AACGGTACGC ACGGCGTCAA CCTGACCCGG
CATAATTATC CGGGGCTACA TGGCACCCTG CAAACCGCCG CTACACAGTG TACCGATAAT
CTTGATGACG CCCTTTTACT GCCTGCATTT GATCAGGCGG TAAGCATCAA CCGTTCGCAG
GACGACCACA GTCACGGGCT GGGCACACTG GAGCTTAAAT TCCGTTATTA CGTTGATTTA
AATAAACATT ATGTCTATTT CTATGATTTA ATCAACTCAC GGCGCTTCGC CATGCATCGT
TGGACTTTTT TACAAAAAGG CACAATGGGT ATTAACAGAA AAGATATAGA TAAACTTTTT
ACCGGCCGTA CGGTTATTTC AAGTATTTAC ATGGATGATA TTACCCAGGA AAACGTCATG
AGCTTTTTAA CGCCAGTCTA TCTGGCGGGA ACGTTAAAAG GTATCGTGAT GGTGGATGTT
AACCAGGATA ATTTAAAAAA TATTTTTTAT ACCCAGGACC GTCCGCTGGT TTGGCGTTAT
CTTAACGTAA CACTAAAGGA CATGGACTCC GGAAAAGAAA TTATTATTAA TCAAAGCAAA
AATAATCTGT TTCAATATGT GAATTATAGC CATGATATCC CGGGTGGACT GCGCGTTTCG
TTGTCCCTTG ATTTAACCTA TTTCCTTGTC TCGTCCTGGA AAGCGCTGGC CTTTTACTTA
CTGGCAACGG CGCTCCTGCT TAATATGGTA CGGATGCACT TTCGGCTTTA TCGCAACGTC
ACACGCGAAA ATATTAGCGA TGCCATGACC GGGCTTTACA ACCGTAAAAT ATTAACACCG
GAGCTGGAAC AGCGACTGCA ACGCCTGGTC AATGCCGGGA CGCCGGTGAC ATTTGTCGCT
ATTGATTGCG ACAGGTTAAA ACTGATCAAC GATACCCAGG GGCACCAGGA AGGCGACCGA
ATTATAACCC TGTTGGCGAA AGCGATTAAA ACATCGATTC GTAAAAGCGA TTATGCCATT
CGCCTCGGCG GCGATGAGTT CTGTATTATT CTTGTTGATT ATGCGGCGGA TTTGGCTATC
CATCTGCCGG AGCGTATTAT TCGTAACCTG CAAATTATCG CACCGGATAA GACAGTCCAT
TTTTCCGCCG GGATTTATAA TATGCAGCCC AATGATACGA TTAATGATGC CTACCAGGCT
TCCGATGCGC AGCTCTATCT GAACAAACAG CAAAAACAAC ATCGTTCATC ATAG
 
Protein sequence
MNLHHKALRH FISASVIVLT SSFLIYELIA SDRAMNAYMR YIMERADSSF LYDKYQNQSI 
AAHLMRTFEA PGDPVTAEKR RAFCDAFEAI NGTHGVNLTR HNYPGLHGTL QTAATQCTDN
LDDALLLPAF DQAVSINRSQ DDHSHGLGTL ELKFRYYVDL NKHYVYFYDL INSRRFAMHR
WTFLQKGTMG INRKDIDKLF TGRTVISSIY MDDITQENVM SFLTPVYLAG TLKGIVMVDV
NQDNLKNIFY TQDRPLVWRY LNVTLKDMDS GKEIIINQSK NNLFQYVNYS HDIPGGLRVS
LSLDLTYFLV SSWKALAFYL LATALLLNMV RMHFRLYRNV TRENISDAMT GLYNRKILTP
ELEQRLQRLV NAGTPVTFVA IDCDRLKLIN DTQGHQEGDR IITLLAKAIK TSIRKSDYAI
RLGGDEFCII LVDYAADLAI HLPERIIRNL QIIAPDKTVH FSAGIYNMQP NDTINDAYQA
SDAQLYLNKQ QKQHRSS