Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3044 |
Symbol | hypF |
ID | 6483827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2964238 |
End bp | 2966478 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642738359 |
Product | [NiFe] hydrogenase maturation protein HypF |
Protein accession | YP_002042083 |
Protein GI | 194444479 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.0442522 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAG ACACACCTTC CGGCGTACAG CTACGCATTC GGGGCAAAGT ACAGGGCGTC GGTTTTCGTC CTTTTGTCTG GCAGCTGGCG CAGCAGTTGC GATTACACGG CGACGTGTGT AATGACGGCG ACGGCGTCGT CGTTCGGCTG CTGGAAGAGC CGTCGCAATT TATTGCCGCG CTCTATCAGG ATTGCCCGCC GCTGGCGCGC ATTGACAGCG TTGAACACGC GTCGTTGGTA TGGGAGCGCG CACCGACGGA TTTCACCATT CGTCAGAGCG CAGGCGGTTC GATGAACACG CAAATCGTGC CGGATGCGGC GACCTGCCCG GCATGTCTTG CCGAGATGAA TACCCCAGGC GAGCGGCGCT ACCGTTATCC TTTCATCAAT TGCACCCACT GTGGACCACG CTTCACCATT ATTCGTGCTA TGCCCTATGA CCGGCCATTT ACGGTGATGG CGGCGTTTCC CTTGTGTCCG GAATGCGACA GCGAATACCG CGATCCGTAC GATCGCCGTT TTCATGCCCA GCCCGTTGCC TGTCCGTCAT GCGGGCCGCA TCTTGAGTGG CGGAGCCAAC ATGAACGAGC GGAAAAAGAG GCGGCTTTGC AGGCGGCGGT CGCCCAACTG AACGCCGGAG GCATTATTGC CGTTAAAGGG CTGGGTGGTT TTCATCTGGC CTGCGACGCG CGCAACGATA ACGCAGTGGC GATGCTGCGG GCGCGTAAAC ATCGTCCGGC GAAACCATTG GCGGTGATGT TGCCCACGGC GCAAACGCTG CCGAGCGCGG CGCGTTCGCT GCTGACCACG CCAGCGGCCC CGATTGTACT GGTGGATAAG CAGTATGTAC CTTCGCTGAG TGAGGGCATC GCGCCAGGAC TTACGGAAGT GGGCGTGATG CTGCCAGCCA ACCCATTGCA ACATCTCTTG TTGCAGGCGC TCAATTACCC GCTGGTGATG ACATCCGGCA ACCTGAGCGG CAAACCGCCC GCCATCACCA ACGAACAGGC GCTGGACGAT TTACACGATA TTGCCGATGG TTTTCTGTTG CACAATCGCG ACATTGTACA GCGCATGGAC GACTCTGTCG TGCGCGACAG CGGCGAAATG CTGCGTCGTT CGCGGGGATA CGTGCCGGAC GCGATTGCGT TGCCGCCCGG ATTTCGCGAT GTGCCGCCGA TACTTTGTCT GGGCGCGGAT CTGAAAAACA CGTTCTGTCT GGTACGCGGC GAACAGGCAG TTGTCAGCCA GCATTTAGGC GATCTCAGCG ATGACGGTAT CCAGGCGCAG TGGCGCGAGG CATTGCGTTT GATCCAGTCA ATCTACGATT TTACCCCAGA GCGTATCGTC TGTGATGCGC ATCCGGGCTA TGTTTCCAGT CAGTGGGCCA GTGAGATGCG TCTACCGACA GAGACGGTGT TACACCATCA TGCCCATGCG GCGGCTTGCC TGGCCGAGCA TGGGTGGCCG CTGGACGGCG GAGAGGTGAT TGCCCTGACG GTAGACGGTA TCGGCATGGG TGAGAATGGC GCGCTATGGG GCGGAGAATG TTTGCGGGTC AATTATCGCG AATGCGAGCA TTTAGGCGGT TTACCCGCCG TGGCGCTGCC GGGAGGCGAT CTGGCTGCCA AACAGCCGTG GCGTAATCTG TTAGCGCAGT GCCTGCGCTT TGTGCCGGAC TGGCAGGATT ACCCGGAGAC AGCGGGGCTG CAACAGCAAA ACTGGAGTGT CCTGGCGCGC GCCATTGAGC GCGGCGTCAA TTCCCCGTTG GCCTCTTCCT GCGGGCGGTT GTTTGACGCG GTGGCTGCCG CGCTTCGCTG CGCGCCAGCA TCGCTTAGCT ATGAGGGCGA GGCCGCCTGC GCGCTGGAGG CGCTGGCCTC TCAATGCGCT AACGTTGAGC ATCCGGTAAC GATGCCGCTT AACGGCGCTC AACTGGACGT CGCTGTTTTC TGGCGTCAAT GGTTGAACTG GCAGGCCACG CCCGCGCAAC GCGCCTGGGC TTTTCATGAT GCGCTGGCGT GCGGGTTTGC CACGCTAATG CGCCAGCAGG CTACGGCGCG GGGGATAACC ACTCTGGTCT TCAGCGGCGG GGTAATACAC AACCGCTTAC TTCGCGCGCG TCTTGCCTTT TATCTTTCTG ATTTTAAATT GTTATTTCCG CAGCGGTTAC CGGCGGGCGA CGGCGGGCTG TCGTTTGGAC AGGGCGTGAT TGCCGCAGCG CGAGCGTTAA GTGAAGTGTA G
|
Protein sequence | MAIDTPSGVQ LRIRGKVQGV GFRPFVWQLA QQLRLHGDVC NDGDGVVVRL LEEPSQFIAA LYQDCPPLAR IDSVEHASLV WERAPTDFTI RQSAGGSMNT QIVPDAATCP ACLAEMNTPG ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ECDSEYRDPY DRRFHAQPVA CPSCGPHLEW RSQHERAEKE AALQAAVAQL NAGGIIAVKG LGGFHLACDA RNDNAVAMLR ARKHRPAKPL AVMLPTAQTL PSAARSLLTT PAAPIVLVDK QYVPSLSEGI APGLTEVGVM LPANPLQHLL LQALNYPLVM TSGNLSGKPP AITNEQALDD LHDIADGFLL HNRDIVQRMD DSVVRDSGEM LRRSRGYVPD AIALPPGFRD VPPILCLGAD LKNTFCLVRG EQAVVSQHLG DLSDDGIQAQ WREALRLIQS IYDFTPERIV CDAHPGYVSS QWASEMRLPT ETVLHHHAHA AACLAEHGWP LDGGEVIALT VDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD LAAKQPWRNL LAQCLRFVPD WQDYPETAGL QQQNWSVLAR AIERGVNSPL ASSCGRLFDA VAAALRCAPA SLSYEGEAAC ALEALASQCA NVEHPVTMPL NGAQLDVAVF WRQWLNWQAT PAQRAWAFHD ALACGFATLM RQQATARGIT TLVFSGGVIH NRLLRARLAF YLSDFKLLFP QRLPAGDGGL SFGQGVIAAA RALSEV
|
| |