Gene SeD_A1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1803 
Symbol 
ID6873086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1749292 
End bp1750353 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content50% 
IMG OID642784938 
Producthydrogenase-1 operon protein HyaF2 
Protein accessionYP_002215606 
Protein GI198245218 
COG category[C] Energy production and conversion 
COG ID[COG1773] Rubredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.080124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTATT CCAGAACTAT TCCAGTCGTT AATATTGCCG GACCAGGCTC GCAGCCAGAA 
GAGGAAGACT TTAACTTTCT TCCTATCCCC GCCGGCATTA ATCTCCCGCT GACGCCGGTT
TTACCGGAAC AGGCGCTGCC CGCTGAGCTC CGTGTTGCCA GACACATCCT GACTACGCTT
ATTCGCGATA TGGATAACCC AGTGGCAACG CTCCCCTTTC CCCTGAGCTA TAAGCTGAAT
GCCACTGAGC AACAGAATAG CGGTTTATTG GATCAACTGC TCGGCGAAGG TGAAATCTCC
GCCCGGGTAC TATTATCCGA TGGAAAAGAA CAGCGTATTC AGGAGACGGT TTTTACGGGC
GTCTGGCGTG TGCGTGAATA TAACGCTGAC CAGCAACGGG TTGCCGATGA AATTATCATT
GGCCCGATCC CAGAGAGCAT CTGGCAGACG CATCCGCAGC CGCCGATTAC GCCAGAATTG
CCGCCACAAC CGGCGGGATT GATGAATGGT GCCTTTATCG CGCATGAAAT AGCCGAGCGC
GTAAAACAGC CGGTAAAAGA GCCGCATATC ATTAACTTAA CGCTGTTGCC AGTAAACGAT
GCCGATCGCG AGTATCTGGA TCATTTTTTA GGCGAAGGTT GTAGCGCTAT TTTTTCACGC
GGATATGGTA AATGCCGGAT TGTAAGCACG CATTTTCCCG GCGTATGGCG GGTCAATTAT
TTCAATGATA TGAACACATT ACTGCAAGAT ATGATTGAGA TAGCGGACAT TCCTGATATC
GCCGTTGCAG GCATCGATGA TATCGAAGAT GCCTACGCGG GGCTAAAAAA TACGTTGGAA
TGGTTGAAAG AATACCCGGT TACAGAAAAT GAGCCAGTGG TGCGCATGGA GTGCAAAGTA
TGTTGGTGGG TTTACGACCC TGCGCTGGGC GATGACGTAT GGCAAATTCC ACCCGGTGTG
CCCTTCAGCC AGTTACCTGA TTACTGGTGC TGTCCGGTTT GCGAAACCAG TAAGTCCGGG
TTTATGGTGA TCGATGAAGG TAATAGTTCG TGCAAAGATT GA
 
Protein sequence
MNYSRTIPVV NIAGPGSQPE EEDFNFLPIP AGINLPLTPV LPEQALPAEL RVARHILTTL 
IRDMDNPVAT LPFPLSYKLN ATEQQNSGLL DQLLGEGEIS ARVLLSDGKE QRIQETVFTG
VWRVREYNAD QQRVADEIII GPIPESIWQT HPQPPITPEL PPQPAGLMNG AFIAHEIAER
VKQPVKEPHI INLTLLPVND ADREYLDHFL GEGCSAIFSR GYGKCRIVST HFPGVWRVNY
FNDMNTLLQD MIEIADIPDI AVAGIDDIED AYAGLKNTLE WLKEYPVTEN EPVVRMECKV
CWWVYDPALG DDVWQIPPGV PFSQLPDYWC CPVCETSKSG FMVIDEGNSS CKD