Gene SeD_A1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1531 
Symbol 
ID6874703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1479971 
End bp1481764 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content55% 
IMG OID642784683 
Producthydrogenase 1 large subunit 
Protein accessionYP_002215353 
Protein GI198245697 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.447937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.256595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACC AGTATCAAAC CCAGGGTTAT ATCGTTAACG ACGCCGGCCG TCGTCTGATC 
GTGGACCCGA TTACCCGCAT CGAAGGGCAT ATGCGCTGCG AAGTGAACAT TGATGAGCAA
AACGTCATTA CCAATGCCGT TTCCTGCGGC ACAATGTTCC GCGGGCTGGA GATCATTCTG
CAAGGCCGTG ACCCGCGTGA CGCCTGGGCG TTTGTTGAAC GTATTTGTGG TGTATGTACC
GGGGTACATG CTCTGGCATC GGTGTACGCT ATCGAAGATG CGATTGGTAT CCAGGTGCCG
GATAACGCCA ATATTATACG TAACATCATG CTGGCTACGT TGTGGTGTCA CGATCATCTG
GTCCATTTTT ATCAGTTAGC CGGTATGGAC TGGATTGATG TTCTGAATGC GCTGAAAGCC
GATCCGCGCG CCACCTCGCA GTTGGCGCAA AGCCTTTCCG CCTGGCCGAT GTCATCGCCG
GGCTACTTCT TTGACGTACA AAATCGATTA AAGAAATTTG TCGATGGCGG CCAGCTTGGG
ATATTCCGTA ACGGCTATTG GGGACATCCA CAGTACAAAC TGTCGCCAGA GGCTAATCTG
ATGGGCTTTG CCCACTATCT TGAAGCCCTC GACTTCCAGC GCGAGATCGT CAAAATCCAT
ACCATATTTG GCGGTAAGAA CCCCCATCCT AACTGGATCG TCGGCGGAAT GCCGTGCGCG
ATCAATCTCG ATCAAAGCGG CGCGGTCGGG GCTATCAATA TGGAGCGCCT CAACCTGGTG
CAGTCGATCA TCACCCGCAC CGCCGACTTC ATTAACAACG TGATGGTTCC GGACGCGCTG
GCTATCGGCC AGTTCAATAA AGCATGGAGT CAAATTGGCA CCGGTCTGTC GGATAAATGC
GTGCTGAGCT ACGGCGCATT TCCGGACATT GCCAACGACT TCAGCCAGCA AAGCCTGTTA
ATGCCGGGCG GCGCGGTGGT CAATGGCGAT TTTAAAAATG TCATGCCGGT AGATTTGGCC
GATCCGCAGC AGATACAGGA GTTTGTGGAT CACGCCTGGT ACCGCTATCC GGACGATCGG
CTGGGGCGTC ATCCTTTTGA CGGCATTACC GATCCCTGGT ACAACCCGGG TGATGTCAAA
GGCAGCGATA CGCATATCCA GCAGCTCAAC GAGCAGGAAC GTTACTCCTG GATTAAAGCG
CCGCGCTGGC GGGGTCATGC CATGGAAGTC GGCCCGCTTG CCCGAACGCT TATCGCGTAC
CACAAAGGGG ATGCCGCCAC CATAGAGTCG GTAGATCGCA TGATGTCTGC CCTTAAACTG
CCGCTCTCCG GTATCCAGTC TACGCTTGGC CGTATTTTAT GCCGCGCGCA CGAAGCGCAA
TGGGCCGTCG GTAAGCTGCA GTATTTTTTC GACAGGCTTA TGACCAACCT GAAAAACGGC
GACCTGGCCA CCGCCAATAC CGAGAAATGG GAACCGGCCA GTTGGCCGCA GCACTGCCGC
GGCATCGGTT TTACCGAAGC GCCTCGTGGA GCGTTGGGAC ACTGGGCGTC AATACGCGAT
CAGAAGATTG AACTCTATCA GTGCGTGGTC CCCACCACCT GGAACGCCAG CCCGCGCGAT
CCTAAAAAAC AGATCGGCGC CTATGAAGCG GCATTGATGG GAACGCAAAT GGCGATTCCC
GACCAACCGT TAGAAATCCT GAGGACGCTG CACAGCTTCG ACCCCTGCCT CGCCTGTTCA
ACCCACGTGC TCGGCGACGA CGGCAGCGAA CTGATTGCCG TCCAGGTACG CTAA
 
Protein sequence
MSNQYQTQGY IVNDAGRRLI VDPITRIEGH MRCEVNIDEQ NVITNAVSCG TMFRGLEIIL 
QGRDPRDAWA FVERICGVCT GVHALASVYA IEDAIGIQVP DNANIIRNIM LATLWCHDHL
VHFYQLAGMD WIDVLNALKA DPRATSQLAQ SLSAWPMSSP GYFFDVQNRL KKFVDGGQLG
IFRNGYWGHP QYKLSPEANL MGFAHYLEAL DFQREIVKIH TIFGGKNPHP NWIVGGMPCA
INLDQSGAVG AINMERLNLV QSIITRTADF INNVMVPDAL AIGQFNKAWS QIGTGLSDKC
VLSYGAFPDI ANDFSQQSLL MPGGAVVNGD FKNVMPVDLA DPQQIQEFVD HAWYRYPDDR
LGRHPFDGIT DPWYNPGDVK GSDTHIQQLN EQERYSWIKA PRWRGHAMEV GPLARTLIAY
HKGDAATIES VDRMMSALKL PLSGIQSTLG RILCRAHEAQ WAVGKLQYFF DRLMTNLKNG
DLATANTEKW EPASWPQHCR GIGFTEAPRG ALGHWASIRD QKIELYQCVV PTTWNASPRD
PKKQIGAYEA ALMGTQMAIP DQPLEILRTL HSFDPCLACS THVLGDDGSE LIAVQVR