Gene SNSL254_A3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3044 
SymbolhypF 
ID6483827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2964238 
End bp2966478 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content60% 
IMG OID642738359 
Product[NiFe] hydrogenase maturation protein HypF 
Protein accessionYP_002042083 
Protein GI194444479 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0442522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAG ACACACCTTC CGGCGTACAG CTACGCATTC GGGGCAAAGT ACAGGGCGTC 
GGTTTTCGTC CTTTTGTCTG GCAGCTGGCG CAGCAGTTGC GATTACACGG CGACGTGTGT
AATGACGGCG ACGGCGTCGT CGTTCGGCTG CTGGAAGAGC CGTCGCAATT TATTGCCGCG
CTCTATCAGG ATTGCCCGCC GCTGGCGCGC ATTGACAGCG TTGAACACGC GTCGTTGGTA
TGGGAGCGCG CACCGACGGA TTTCACCATT CGTCAGAGCG CAGGCGGTTC GATGAACACG
CAAATCGTGC CGGATGCGGC GACCTGCCCG GCATGTCTTG CCGAGATGAA TACCCCAGGC
GAGCGGCGCT ACCGTTATCC TTTCATCAAT TGCACCCACT GTGGACCACG CTTCACCATT
ATTCGTGCTA TGCCCTATGA CCGGCCATTT ACGGTGATGG CGGCGTTTCC CTTGTGTCCG
GAATGCGACA GCGAATACCG CGATCCGTAC GATCGCCGTT TTCATGCCCA GCCCGTTGCC
TGTCCGTCAT GCGGGCCGCA TCTTGAGTGG CGGAGCCAAC ATGAACGAGC GGAAAAAGAG
GCGGCTTTGC AGGCGGCGGT CGCCCAACTG AACGCCGGAG GCATTATTGC CGTTAAAGGG
CTGGGTGGTT TTCATCTGGC CTGCGACGCG CGCAACGATA ACGCAGTGGC GATGCTGCGG
GCGCGTAAAC ATCGTCCGGC GAAACCATTG GCGGTGATGT TGCCCACGGC GCAAACGCTG
CCGAGCGCGG CGCGTTCGCT GCTGACCACG CCAGCGGCCC CGATTGTACT GGTGGATAAG
CAGTATGTAC CTTCGCTGAG TGAGGGCATC GCGCCAGGAC TTACGGAAGT GGGCGTGATG
CTGCCAGCCA ACCCATTGCA ACATCTCTTG TTGCAGGCGC TCAATTACCC GCTGGTGATG
ACATCCGGCA ACCTGAGCGG CAAACCGCCC GCCATCACCA ACGAACAGGC GCTGGACGAT
TTACACGATA TTGCCGATGG TTTTCTGTTG CACAATCGCG ACATTGTACA GCGCATGGAC
GACTCTGTCG TGCGCGACAG CGGCGAAATG CTGCGTCGTT CGCGGGGATA CGTGCCGGAC
GCGATTGCGT TGCCGCCCGG ATTTCGCGAT GTGCCGCCGA TACTTTGTCT GGGCGCGGAT
CTGAAAAACA CGTTCTGTCT GGTACGCGGC GAACAGGCAG TTGTCAGCCA GCATTTAGGC
GATCTCAGCG ATGACGGTAT CCAGGCGCAG TGGCGCGAGG CATTGCGTTT GATCCAGTCA
ATCTACGATT TTACCCCAGA GCGTATCGTC TGTGATGCGC ATCCGGGCTA TGTTTCCAGT
CAGTGGGCCA GTGAGATGCG TCTACCGACA GAGACGGTGT TACACCATCA TGCCCATGCG
GCGGCTTGCC TGGCCGAGCA TGGGTGGCCG CTGGACGGCG GAGAGGTGAT TGCCCTGACG
GTAGACGGTA TCGGCATGGG TGAGAATGGC GCGCTATGGG GCGGAGAATG TTTGCGGGTC
AATTATCGCG AATGCGAGCA TTTAGGCGGT TTACCCGCCG TGGCGCTGCC GGGAGGCGAT
CTGGCTGCCA AACAGCCGTG GCGTAATCTG TTAGCGCAGT GCCTGCGCTT TGTGCCGGAC
TGGCAGGATT ACCCGGAGAC AGCGGGGCTG CAACAGCAAA ACTGGAGTGT CCTGGCGCGC
GCCATTGAGC GCGGCGTCAA TTCCCCGTTG GCCTCTTCCT GCGGGCGGTT GTTTGACGCG
GTGGCTGCCG CGCTTCGCTG CGCGCCAGCA TCGCTTAGCT ATGAGGGCGA GGCCGCCTGC
GCGCTGGAGG CGCTGGCCTC TCAATGCGCT AACGTTGAGC ATCCGGTAAC GATGCCGCTT
AACGGCGCTC AACTGGACGT CGCTGTTTTC TGGCGTCAAT GGTTGAACTG GCAGGCCACG
CCCGCGCAAC GCGCCTGGGC TTTTCATGAT GCGCTGGCGT GCGGGTTTGC CACGCTAATG
CGCCAGCAGG CTACGGCGCG GGGGATAACC ACTCTGGTCT TCAGCGGCGG GGTAATACAC
AACCGCTTAC TTCGCGCGCG TCTTGCCTTT TATCTTTCTG ATTTTAAATT GTTATTTCCG
CAGCGGTTAC CGGCGGGCGA CGGCGGGCTG TCGTTTGGAC AGGGCGTGAT TGCCGCAGCG
CGAGCGTTAA GTGAAGTGTA G
 
Protein sequence
MAIDTPSGVQ LRIRGKVQGV GFRPFVWQLA QQLRLHGDVC NDGDGVVVRL LEEPSQFIAA 
LYQDCPPLAR IDSVEHASLV WERAPTDFTI RQSAGGSMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ECDSEYRDPY DRRFHAQPVA
CPSCGPHLEW RSQHERAEKE AALQAAVAQL NAGGIIAVKG LGGFHLACDA RNDNAVAMLR
ARKHRPAKPL AVMLPTAQTL PSAARSLLTT PAAPIVLVDK QYVPSLSEGI APGLTEVGVM
LPANPLQHLL LQALNYPLVM TSGNLSGKPP AITNEQALDD LHDIADGFLL HNRDIVQRMD
DSVVRDSGEM LRRSRGYVPD AIALPPGFRD VPPILCLGAD LKNTFCLVRG EQAVVSQHLG
DLSDDGIQAQ WREALRLIQS IYDFTPERIV CDAHPGYVSS QWASEMRLPT ETVLHHHAHA
AACLAEHGWP LDGGEVIALT VDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD
LAAKQPWRNL LAQCLRFVPD WQDYPETAGL QQQNWSVLAR AIERGVNSPL ASSCGRLFDA
VAAALRCAPA SLSYEGEAAC ALEALASQCA NVEHPVTMPL NGAQLDVAVF WRQWLNWQAT
PAQRAWAFHD ALACGFATLM RQQATARGIT TLVFSGGVIH NRLLRARLAF YLSDFKLLFP
QRLPAGDGGL SFGQGVIAAA RALSEV