Gene MCA1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1604 
SymbolhypF 
ID3104635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1710851 
End bp1713115 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content70% 
IMG OID637170772 
Product[NiFe] hydrogenase maturation protein HypF 
Protein accessionYP_114054 
Protein GI53804353 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00057] Sua5/YciO/YrdC/YwlC family protein
[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGACCG CCCAGCGCAT CCAGGTCCGC GGCCGTGTGC AAGGCGTGGG GTTCCGCCCC 
TTCGTCTGCC GGCTCGCCCG CGAATTCGGC CTCGCCGGCT GGGTGAGGAA CCGCGGCGGC
GGCGTCGAAA TCCACGCCGA AGGCGATCCA GCCGCGATCG AGCGGCTCAT CGACGCACTG
GTCTCCCGTG CCCCGCCGCT GGCCGATCCG CAGCCGCCCG TTCACCGGCC CGCCGAATTC
CGGGGCTACG CCGGGTTCGA AATCCTGCTC AGCGATTCCG GGGACAAAAC GCCCATCCAC
GTGCCGCCGG ACCACTTCGT CTGCCCCGAT TGCCTGGACG AAATGCGAGA CCCCGCCGCC
CGCCGCTACC GCTACCCCTT CATCAACTGC ACCCAATGCG GGCCGCGCTA CACCCTCATC
GACCGGCTGC CCTACGACCG CCCGAACACC GCGATGGCGG ACTTCCCGCT GTGCCAGGAC
TGCCGGCGCG AGTACGAAGA CATTCACGAC CGGCGCTACC ACGCCCAGCC GCTGGCCTGT
CCTCTGTGCG GGCCGGTGCT GGAATTTCGC GAGCTGTCCG CATGCATCGC CATACCGGGT
AACGAACCGG CGCTCTCGGG CTGCATCGAA GCCCTACGGC AAGGCCAGGT CGTCGCCGTC
AAAGGCGTCG GCGGCTACCA CCTCCTCTGC GACGCCCGCT CGGATGCCGC CGTCGGGCGC
CTGCGCGAAC GCAAACGCCG CCCGGACAAA CCCCTCGCGG TGCTGATCCC CTGGTTCGAA
GGCGAAGGGG TCGATTGGCT GGCACGCCTG GCCGAGCCGC GCCCTGACGA ACGCGAACTG
CTGGCATCCC CGCTCCGACC CATCGTCATC GTGCAACGCT CCGTCGACAG CGATCTGTCC
GGCCTCATCG CGCCGGGCCT GGACGAGATC GGCCTGATGT ACCCCTACAG CCCGCTGCAT
CACCTGCTGG CGGGCGACTA TGGCGCGCCG CTGGTCGCCA CTTCGGCCAA CCTCAGCGGC
GAGCCGGTGC TGACCGACGG CGCGGAAGTC GAGCGCCAGC TGGCTCACGT CGCCGATGCC
TTCCTGCATC ATGACCGGCC GATCCGCCGG CCCGCCGACG ACTCGGTTTA CCGCCGCAGC
GCCGGGAGTA TGCGGCCCTT GCGGCTGGGT CGCGGTACGG CGCCGCTCGA AATGCCTCTC
CGCTACCCGG TGGCCGAACC CACTCTTGCG CTCGGGGCCG ATCTCAAGAA CACCATTGCC
CTGGCCTTCG AAGACCGCGC GGTCGTCTCC CCGCACCTCG GCAATCTCGG CGCGCCGCGC
AGCCTCGACG TATTCGGGCA ACTGATTCGG GAGCTGTCCG CCTTGTACGG GATCGCGCCG
AAGCGGGTGG TCTGCGACGC TCACCCCGAC TACTTCTCCA GTCGCTGGGC CCGGGCCTGC
GGGCTCGAAA TTCACCGCGT CCTCCACCAC CATGCACACG CCTCGGCGCT TTACGGCGAG
TTCGGGCCGG ACGGCGACAT CCTGGTGTTC GCCTGGGACG GCACCGGCTT CGGCGGCGAC
GGCACCCTGT GGGGCGGCGA AACCCTGCTC GGCCGGCCTG GCGGCTGGCG CCGGGTCGGA
AGCCTGCGGC CGTTCCGGCT CATCGGAGGC GAAAAGGCCA GCGCCGAGCC CTGGCGCTGC
GCGCTCGCGG CCTGCTGGGA AGCGGGGCTG GACTGGTCCG GCTGCCCGGT GGACCCCGCG
CCGTATCGGC ACGTCTGGGA ACGCGGGATC AACAGTCCCT ACACGAGCTC CGCCGGACGT
CTGTTCGACG CCGCCGCCGC CCTGATCGGC GTCACGCTGA ATCAAAGCCA CGAAGGCCAG
GCCGGTATGC GGCTGGAAGC GCTGGCCGGC GACACGGCGG ATTTCATCGA ACTGCCGGCG
CGGCGTGAGA ACGGCCTGTA CCGCATCGAC TGGAGCCCTC TGCTGCCCGC TTTGATGGAC
GGGAGCCAAC CGGCCGGCTA CCGCAGCGCC CTGCTCCATG CCAGCCTGGC CCACGCGGTT
CTGGCCCAGG CCCGAGCGAT CCGCGGCGAA TCGGGCGTAA ACCTAGCCGG CCTCACCGGC
GGCGTGTTCC AGAACCGTAT TCTCGCCGAA CTGACCGCGA ATCTGCTGCG GGCCGACGGA
TTCGAAGTCG TCCTGCCCGC CAGCCTTCCT GTCAACGATG CCGCGATCGC CTACGGCCAG
CTCGTCGAGA CCGCCGGCGC CGGCCGGGCG CCCAGCCCCG CTTGA
 
Protein sequence
MRTAQRIQVR GRVQGVGFRP FVCRLAREFG LAGWVRNRGG GVEIHAEGDP AAIERLIDAL 
VSRAPPLADP QPPVHRPAEF RGYAGFEILL SDSGDKTPIH VPPDHFVCPD CLDEMRDPAA
RRYRYPFINC TQCGPRYTLI DRLPYDRPNT AMADFPLCQD CRREYEDIHD RRYHAQPLAC
PLCGPVLEFR ELSACIAIPG NEPALSGCIE ALRQGQVVAV KGVGGYHLLC DARSDAAVGR
LRERKRRPDK PLAVLIPWFE GEGVDWLARL AEPRPDEREL LASPLRPIVI VQRSVDSDLS
GLIAPGLDEI GLMYPYSPLH HLLAGDYGAP LVATSANLSG EPVLTDGAEV ERQLAHVADA
FLHHDRPIRR PADDSVYRRS AGSMRPLRLG RGTAPLEMPL RYPVAEPTLA LGADLKNTIA
LAFEDRAVVS PHLGNLGAPR SLDVFGQLIR ELSALYGIAP KRVVCDAHPD YFSSRWARAC
GLEIHRVLHH HAHASALYGE FGPDGDILVF AWDGTGFGGD GTLWGGETLL GRPGGWRRVG
SLRPFRLIGG EKASAEPWRC ALAACWEAGL DWSGCPVDPA PYRHVWERGI NSPYTSSAGR
LFDAAAALIG VTLNQSHEGQ AGMRLEALAG DTADFIELPA RRENGLYRID WSPLLPALMD
GSQPAGYRSA LLHASLAHAV LAQARAIRGE SGVNLAGLTG GVFQNRILAE LTANLLRADG
FEVVLPASLP VNDAAIAYGQ LVETAGAGRA PSPA