Gene SeD_A3514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3514 
Symbol 
ID6875202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3371188 
End bp3372351 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content55% 
IMG OID642786505 
Productalcohol dehydrogenase YqhD 
Protein accessionYP_002217142 
Protein GI198243581 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT TCAAATTACA TACCCCTACC CGCATTCTGT TTGGTAAAGG CGCTATCGTC 
GATCTTCGCG ATCAAATCCC GCAGGACGCC CGCGTGCTGA TTACCTATGG CGGCGGTAGC
GTGAAAAAGA CCGGCGTACT GGCGCAAGTT CAGGAAGCGT TAAAAGGACT GGATGTTCGG
GAGTTTGGCG GTATTGAACC GAACCCGTCC TATGAAACGC TGATGAAGGC CGTGCAGCTT
GTGCGCGACG AAAATATTAC CTTCCTGCTG GCCGTTGGCG GCGGTTCCGT ACTGGACGGC
ACCAAATTTA TCGCCGCAGC GGCGCAGTAT ACGGACGGCG TCGATCCGTG GCACATTCTG
GAAACCGGCG GTACTGAGAT TCGTAGCGCA ATCCCGATGG GGTCTGTGCT GACCCTACCG
GCAACCGGCT CAGAATCGAA CGCCGGCGCA GTCATTTCCC GTAAAACCAC TGGCGACAAA
CAGGCCTTCC ACTCCTCGTT CGTGCAACCG GTGTTTGCCG TACTGGACCC GGTCTACACC
TATACATTGC CGCCGCGCCA GGTTGCGAAC GGCGTCGTGG ATGCTTTCGT CCATACCGTT
GAGCAGTACG TCACTTACCC GGTTAACGGC AAAATTCAGG ATCGTTTCGC CGAAGGTATT
TTACTCACAC TGATCGAAGA AGGTCCCAAA GCGCTGCAAG AGCCTGAAAA TTATGACGTC
CGCGCTAACG TGATGTGGGC CGCTACTCAG GCGTTGAATG GCCTGATTGG CGCTGGTGTT
CCGCAGGACT GGGCGACACA TATGCTGGGC CATGAGCTTA CCGCGATGCA CGGTCTCGAT
CATGCCCAGA CGCTGGCTAT CATTCTGCCT GCGCTATGGA ACGAAAAACG AGACGTTAAA
CGTGCAAAAC TCCTGCAATA TGCTGAACGT GTATGGAATA TCACCGACGG TTCCGACGAC
GAGCGTATTG ATGCCGCCAT TGCCGCCACC CGCCGTTTCT TTGAACAGAT GGGCGTGCCT
ACCCGTCTGT CCGATTACGG TCTGGATGGC AGCACTATTC CGGCGCTACT GGCTAAACTT
GAGGCGCATG GATGCAAAAA TTTAGGTGAA AATCAGGATA TTACGCTGGA TGTCAGCCGT
CGGATTTATG AAGCGGCGCG CTAA
 
Protein sequence
MNNFKLHTPT RILFGKGAIV DLRDQIPQDA RVLITYGGGS VKKTGVLAQV QEALKGLDVR 
EFGGIEPNPS YETLMKAVQL VRDENITFLL AVGGGSVLDG TKFIAAAAQY TDGVDPWHIL
ETGGTEIRSA IPMGSVLTLP ATGSESNAGA VISRKTTGDK QAFHSSFVQP VFAVLDPVYT
YTLPPRQVAN GVVDAFVHTV EQYVTYPVNG KIQDRFAEGI LLTLIEEGPK ALQEPENYDV
RANVMWAATQ ALNGLIGAGV PQDWATHMLG HELTAMHGLD HAQTLAIILP ALWNEKRDVK
RAKLLQYAER VWNITDGSDD ERIDAAIAAT RRFFEQMGVP TRLSDYGLDG STIPALLAKL
EAHGCKNLGE NQDITLDVSR RIYEAAR