Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3151 |
Symbol | hypF |
ID | 6875040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3032127 |
End bp | 3034367 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642786172 |
Product | [NiFe] hydrogenase maturation protein HypF |
Protein accession | YP_002216813 |
Protein GI | 198244103 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.114477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAG ACACACCTTC CGGCGTACAG CTACGCATTC GGGGCAAAGT ACAGGGCGTC GGTTTTCGTC CTTTTGTCTG GCAGCTGGCG CAGCAGTTGC AATTACACGG CGACGTGTGT AATGACGGCG ACGGCGTCGT CGTTCGGCTG CTGGAAGATC CGTCGCAATT TATTGCCGCG CTCTATCAGG ATTGCCCGCC GCTGGCGCGC ATTGACAGCG TTGAACACGC GTCGTTGGTA TGGGAGCGCG CACCGAGGGA TTTCACCATT CGTCAGAGCG CAGGCGGTTC GATGAACACG CAAATCGTGC CGGATGCGGC GACCTGCCCG GCATGTCTTG CCGAGATGAA TACCCCAGGC GAGCGGCGCT ACCGTTATCC TTTCATCAAT TGCACCCACT GCGGACCACG CTTCACCATT ATTCGCGCTA TGCCCTATGA CCGGCCATTT ACGGTGATGG CGGCGTTTCC CTTGTGTCCG GAATGCGACA GCGAATACCG GGATCCGTAC GATCGCCGTT TTCATGCCCA GCCCGTTGCC TGTCCGTCAT GCGGGCCGCA TCTTGAGTGG CGGAGCCAAC ATGAACGAGC GGAAAAAGAG GCGGCTTTGC AGGCGGCGGT CGCCCAACTG AACGCCGGAG GCATTATTGC CGTTAAAGGG CTGGGTGGTT TTCATCTGGC CTGCGACGCG CGCAACGATA ACGCAGTGGC GATGCTGCGG GCGCGTAAAC ATCGTCCGGC GAAACCATTG GCGGTGATGT TGCCCACGGC GCAAACGCTG CCGACCGCGG CGCGTTCGCT GCTGACCACG CCAGCGGCCC CGATTGTGCT GGTGGATAAG CAGTATATAC CTTCGCTGAG TGAGGGCATC GCGCCAGGAC TTACGGAGGT GGGTGTGATG CTGCCAGCCA ACCCATTGCA ACACCTCTTG TTGCAGGAGC TCAATTACCC GCTGGTGATG ACATCCGGCA ACCTGAGCGG CAGACCGCCC GCCATCACCA ACGAACAGGC GCTGGACGAT TTACACGATA TTGCCGATGG TTTTCTGTTG CACAATCGCG ACATTGTACA GCGCATGGAC GACTCTGTCG TGCGCGACAG CGGCGAAATG CTGCGTCGTT CGCGGGGATA CGTGCCGGAC GCGATTGCGT TGCCGCCCGG ATTTCGCGAT GTGCCGCCGA TACTTTGTCT GGGCGCGGAT CTGAAAAACA CGTTCTGTCT GGTACGCGGC GAACAGGCGG TTGTCAGCCA GCATTTAGGC GATCTCAGCG ATGACGGTAT CCAGGCGCAG TGGCGCGAGG CATTGCGTCT GATCCAGTCA ATCTACGATT TTACCCCAGA GCGTATCGTC TGTGATGCGC ATCCGGGCTA TGTTTCCAGT CAGTGGGCCA GTGAGATGCG TCTGCCGACA GAGACGGTGT TACACCATCA TGCCCATGCG GCGGCTTGCC TGGCCGAGCA TGGTTGGCCG CTGGACGGCG GAGAGGTGAT TGCCCTGACG GTAGACGGTA TCGGCATGGG TGAGAATGGC GCGCTATGGG GCGGAGAATG TTTGCGGGTC AATTATCGCG AATGCGAACA TTTAGGCGGT TTACCCGCCG TGGCGCTGCC GGGAGGCGAT CTGGCTGCCA AACATCCGTG GCGTAATCTG TTGGCGCAGT GCCTGCGCTT TGTGCCGGAC TGGCAGGATT ATCCGGAGAC AGCGGGGCTG CAACAGCAAA ACTGGAATGT CCTGGCGCGC GCCATTGAGC GCGGCGTCAA TGCCCCATTG GCGTCTTCCT GCGGGCGGTT GTTTGACGCG GTGGCTGCCG CGCTTCGCTG CGCGCCAGCA TCGCTTAGCT ATGAGGGCGA GGCCGCCTGC GCGCTGGAGG CGCTGGCCTC TCAATGCGCT AACGTTGAGC ATCCGGTAAC GATGCCGCTT AACGGCGCTC AACTGGACGT CGCTGTTTTC TGGCGGCAAT GGTTGAACTG GCAGGCCACG CCCGCGCAAC GCGCCTGGGC TTTTCATGAT GCGCTGGCGT GCGGGTTTGC CACGCTAATG CGCCAGCAGG CTACGGCGCG GGGGATAACC ACTCTGGTCT TCAGCGGCGG GGTGATACAC AACCGCTTAC TTCGCGCGCG TCTTGCCTTT TATCTTTCTG ATTTTAAATT GTTATTTCCG CAGCGGTTAC CAGCGGGCGA CGGCGGGCTG TCGTTTGGAC AGGGCGTGAT TGCCGCAGCG CGAGCGTTAA GTGAAGTGTA G
|
Protein sequence | MAIDTPSGVQ LRIRGKVQGV GFRPFVWQLA QQLQLHGDVC NDGDGVVVRL LEDPSQFIAA LYQDCPPLAR IDSVEHASLV WERAPRDFTI RQSAGGSMNT QIVPDAATCP ACLAEMNTPG ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ECDSEYRDPY DRRFHAQPVA CPSCGPHLEW RSQHERAEKE AALQAAVAQL NAGGIIAVKG LGGFHLACDA RNDNAVAMLR ARKHRPAKPL AVMLPTAQTL PTAARSLLTT PAAPIVLVDK QYIPSLSEGI APGLTEVGVM LPANPLQHLL LQELNYPLVM TSGNLSGRPP AITNEQALDD LHDIADGFLL HNRDIVQRMD DSVVRDSGEM LRRSRGYVPD AIALPPGFRD VPPILCLGAD LKNTFCLVRG EQAVVSQHLG DLSDDGIQAQ WREALRLIQS IYDFTPERIV CDAHPGYVSS QWASEMRLPT ETVLHHHAHA AACLAEHGWP LDGGEVIALT VDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD LAAKHPWRNL LAQCLRFVPD WQDYPETAGL QQQNWNVLAR AIERGVNAPL ASSCGRLFDA VAAALRCAPA SLSYEGEAAC ALEALASQCA NVEHPVTMPL NGAQLDVAVF WRQWLNWQAT PAQRAWAFHD ALACGFATLM RQQATARGIT TLVFSGGVIH NRLLRARLAF YLSDFKLLFP QRLPAGDGGL SFGQGVIAAA RALSEV
|
| |