Gene SeD_A3151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3151 
SymbolhypF 
ID6875040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3032127 
End bp3034367 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content60% 
IMG OID642786172 
Product[NiFe] hydrogenase maturation protein HypF 
Protein accessionYP_002216813 
Protein GI198244103 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.114477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAG ACACACCTTC CGGCGTACAG CTACGCATTC GGGGCAAAGT ACAGGGCGTC 
GGTTTTCGTC CTTTTGTCTG GCAGCTGGCG CAGCAGTTGC AATTACACGG CGACGTGTGT
AATGACGGCG ACGGCGTCGT CGTTCGGCTG CTGGAAGATC CGTCGCAATT TATTGCCGCG
CTCTATCAGG ATTGCCCGCC GCTGGCGCGC ATTGACAGCG TTGAACACGC GTCGTTGGTA
TGGGAGCGCG CACCGAGGGA TTTCACCATT CGTCAGAGCG CAGGCGGTTC GATGAACACG
CAAATCGTGC CGGATGCGGC GACCTGCCCG GCATGTCTTG CCGAGATGAA TACCCCAGGC
GAGCGGCGCT ACCGTTATCC TTTCATCAAT TGCACCCACT GCGGACCACG CTTCACCATT
ATTCGCGCTA TGCCCTATGA CCGGCCATTT ACGGTGATGG CGGCGTTTCC CTTGTGTCCG
GAATGCGACA GCGAATACCG GGATCCGTAC GATCGCCGTT TTCATGCCCA GCCCGTTGCC
TGTCCGTCAT GCGGGCCGCA TCTTGAGTGG CGGAGCCAAC ATGAACGAGC GGAAAAAGAG
GCGGCTTTGC AGGCGGCGGT CGCCCAACTG AACGCCGGAG GCATTATTGC CGTTAAAGGG
CTGGGTGGTT TTCATCTGGC CTGCGACGCG CGCAACGATA ACGCAGTGGC GATGCTGCGG
GCGCGTAAAC ATCGTCCGGC GAAACCATTG GCGGTGATGT TGCCCACGGC GCAAACGCTG
CCGACCGCGG CGCGTTCGCT GCTGACCACG CCAGCGGCCC CGATTGTGCT GGTGGATAAG
CAGTATATAC CTTCGCTGAG TGAGGGCATC GCGCCAGGAC TTACGGAGGT GGGTGTGATG
CTGCCAGCCA ACCCATTGCA ACACCTCTTG TTGCAGGAGC TCAATTACCC GCTGGTGATG
ACATCCGGCA ACCTGAGCGG CAGACCGCCC GCCATCACCA ACGAACAGGC GCTGGACGAT
TTACACGATA TTGCCGATGG TTTTCTGTTG CACAATCGCG ACATTGTACA GCGCATGGAC
GACTCTGTCG TGCGCGACAG CGGCGAAATG CTGCGTCGTT CGCGGGGATA CGTGCCGGAC
GCGATTGCGT TGCCGCCCGG ATTTCGCGAT GTGCCGCCGA TACTTTGTCT GGGCGCGGAT
CTGAAAAACA CGTTCTGTCT GGTACGCGGC GAACAGGCGG TTGTCAGCCA GCATTTAGGC
GATCTCAGCG ATGACGGTAT CCAGGCGCAG TGGCGCGAGG CATTGCGTCT GATCCAGTCA
ATCTACGATT TTACCCCAGA GCGTATCGTC TGTGATGCGC ATCCGGGCTA TGTTTCCAGT
CAGTGGGCCA GTGAGATGCG TCTGCCGACA GAGACGGTGT TACACCATCA TGCCCATGCG
GCGGCTTGCC TGGCCGAGCA TGGTTGGCCG CTGGACGGCG GAGAGGTGAT TGCCCTGACG
GTAGACGGTA TCGGCATGGG TGAGAATGGC GCGCTATGGG GCGGAGAATG TTTGCGGGTC
AATTATCGCG AATGCGAACA TTTAGGCGGT TTACCCGCCG TGGCGCTGCC GGGAGGCGAT
CTGGCTGCCA AACATCCGTG GCGTAATCTG TTGGCGCAGT GCCTGCGCTT TGTGCCGGAC
TGGCAGGATT ATCCGGAGAC AGCGGGGCTG CAACAGCAAA ACTGGAATGT CCTGGCGCGC
GCCATTGAGC GCGGCGTCAA TGCCCCATTG GCGTCTTCCT GCGGGCGGTT GTTTGACGCG
GTGGCTGCCG CGCTTCGCTG CGCGCCAGCA TCGCTTAGCT ATGAGGGCGA GGCCGCCTGC
GCGCTGGAGG CGCTGGCCTC TCAATGCGCT AACGTTGAGC ATCCGGTAAC GATGCCGCTT
AACGGCGCTC AACTGGACGT CGCTGTTTTC TGGCGGCAAT GGTTGAACTG GCAGGCCACG
CCCGCGCAAC GCGCCTGGGC TTTTCATGAT GCGCTGGCGT GCGGGTTTGC CACGCTAATG
CGCCAGCAGG CTACGGCGCG GGGGATAACC ACTCTGGTCT TCAGCGGCGG GGTGATACAC
AACCGCTTAC TTCGCGCGCG TCTTGCCTTT TATCTTTCTG ATTTTAAATT GTTATTTCCG
CAGCGGTTAC CAGCGGGCGA CGGCGGGCTG TCGTTTGGAC AGGGCGTGAT TGCCGCAGCG
CGAGCGTTAA GTGAAGTGTA G
 
Protein sequence
MAIDTPSGVQ LRIRGKVQGV GFRPFVWQLA QQLQLHGDVC NDGDGVVVRL LEDPSQFIAA 
LYQDCPPLAR IDSVEHASLV WERAPRDFTI RQSAGGSMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ECDSEYRDPY DRRFHAQPVA
CPSCGPHLEW RSQHERAEKE AALQAAVAQL NAGGIIAVKG LGGFHLACDA RNDNAVAMLR
ARKHRPAKPL AVMLPTAQTL PTAARSLLTT PAAPIVLVDK QYIPSLSEGI APGLTEVGVM
LPANPLQHLL LQELNYPLVM TSGNLSGRPP AITNEQALDD LHDIADGFLL HNRDIVQRMD
DSVVRDSGEM LRRSRGYVPD AIALPPGFRD VPPILCLGAD LKNTFCLVRG EQAVVSQHLG
DLSDDGIQAQ WREALRLIQS IYDFTPERIV CDAHPGYVSS QWASEMRLPT ETVLHHHAHA
AACLAEHGWP LDGGEVIALT VDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD
LAAKHPWRNL LAQCLRFVPD WQDYPETAGL QQQNWNVLAR AIERGVNAPL ASSCGRLFDA
VAAALRCAPA SLSYEGEAAC ALEALASQCA NVEHPVTMPL NGAQLDVAVF WRQWLNWQAT
PAQRAWAFHD ALACGFATLM RQQATARGIT TLVFSGGVIH NRLLRARLAF YLSDFKLLFP
QRLPAGDGGL SFGQGVIAAA RALSEV