Gene Moth_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2179 
Symbol 
ID3831649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2278031 
End bp2280334 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content55% 
IMG OID637830101 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_431011 
Protein GI83591002 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.474067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA GAAGCAATGC AGAGAAAAAG GGGATTGTCC GCTATCGGAT AATCGTCAAA 
GGGATCGTCC AGGGCGTAGG GTTTCGCCCC TTTATTTATA ACCTGGCGAG GCTCTATCGC
CTGAAGGGCA CTGTTGCCAA TACCAGCCAG GGCGTGCTTA TCGAAGCCGA AGGCCAGGAG
GAGACAGTAC ACGCTTTTCT GGCCAGTTTA AAGGAAAAGC ATCCCCCGTT ATCGCGAATA
ACAGCTTTAG AATGGGACGT CCTGGAACCC TGTGGCTATA GTTCCTTTGC TATTATCCCC
AGTGGTGAAG GGCTCGGAAA AGAAGCCTTG ATTCCCCCTG ATGTAGCCCT GTGTGCTGAT
TGTGCCCGGG AAATAATGGA CCCCCGGGAC AGGCATTACG GCTACCCCTT TACCAACTGT
ACCAATTGTG GACCGCGATT TACCATTGTA CGGGGCGTAC CTTATGACCG TGCGAAAACC
TCCATGGCCC GGTTTCCGAT GTGTCCGGAA TGCGCCCGGG AATACCACGA TCCGGGCAAC
AGGAGATTCC ACGCCCAGCC GGCCGCCTGC CCGGCCTGCG GGCCGCAGGT GGAGCTGGTG
GACAGGCAGG GGCGAAAGGT GGAAGGCAAC TGGTTGGAGC TGAGCTGGAG ATTCTTGCAA
GACGGCAAGA TATTAGCCGT AAAAGGGCTG GGAGGCTTTC ACCTGGTATG CGATGCCAAG
AATAGGGCAG CTTTAAAGAC CCTCCGCCGG CGCAAGGGCC GTGAAGCCAA ACCCCTGGCG
GTAATGTGCC TCCTGGAGAC GGCCAGGAAG TACTGTTACG TTGGTCCGGA GGAAGAAAAA
CTCCTGTCTT CTCCTCAAGC ACCTATTGTT ATCCTTACCA AGCGGGCCGA CTGTAATTTG
CCGGATGAGC TGGCCCCGGG GATGAAGACC CTGGGTATAA TGTTACCTTA TACACCGCTG
CACCTGATGC TTTTAAACGG CCCCCTGGAA ATTTTAGTCA TGACCAGCGG TAATCGCAAC
GGTTTGCCCC TGGCAAAAGA CAACGGGAGA GCCCTGGAGG AACTGGGCGG TATAGCCGAC
TATTTTCTCT GGCACAATCG CGAGATTGTC AACCGCTGCG ACGATTCCGT CGTAGCGGTG
ATAGGTGATA CGGCTCAGAT TTTGCGCCGC TCGCGGGGAT ATGTCCCCTC GCCCGTTAAG
GTTGCGGTTA AATCCAGTTC CCCTGTGCTG GGCGCCGGTG GGGACATGAA AAACACCTTT
TGCCTTTTAA AAGGCAATCA AGCCTTTGTC AGCCAGCACA TCGGCGACCT GGGCAGCAGG
GAAGGTGAAG CACACTTTTT CGCCAGCCTG GAGAATTTAA AAAACCTCAT CGGCTCCGAA
CCTGAAGTGG TCGGTTATGA CATGCATCCC GGCTATCGTT CCTCCCGCCT GGCGGCAGGA
ATTCCGGCCA AGGCACACTT TGCCGTTCAA CACCATCACG CCCATATGGT TTCTTGCCTG
GCCGACAATG GCGTGGATGA GGATGCAATC GGGGTAATCC TTGATGGAAC CGGATACGGA
ACCGACGGAC GTCTCTGGGG TTTTGAGATC CTTACCGGGG ATTGTGCCGA TTTTACCAGG
GAGTATCACC TGGCCTATGT ACCGTTGCCC GGTGGCGAGC AGGCAGTACG TTACCCATGG
CGGACAGCAG TAGCCTATTT AATGAAATAC CTTTCGGCGC AAGGCGAGTC GCTTGCCGAC
CGCCTTTTTC AAAGCAGGGG GCAGGAGCTT GAGGTTATCA AACGTCTCGT CGCCACAGGC
TTTAATTCAC CTTTAAGCTC GAGCTGCGGT CGTCTTTTTG ACGCCGTATC GGCCCTCCTC
GGCCTCTGCT ACCATAATAG TTACGAAGGA CAGGCGGCAA TCGAGCTAGG AGAAATGGTC
CTGGACCCGG CGGAGGGCAA AAGATTAATA CCCTATCCTT TTTTTATCGA GGGGAAAGTT
ATCCATCCGG GCGGGGTCAT AGCCGGCGTG GCGGCCGACC TGGAACGGGG AGTTGCCAGG
GAGATTATTG CCACCCGTTT CCACAATACG GTCCTGGCGA TGGTACGCGA GGCGGTACGC
CGGGTTGCTG AAAGAACACA TATAAAAACT GTAGCCCTTA GCGGTGGTGC CTGGCAAAAC
CGCTATCTTT TCAGCCTTGC TAAAGAAATC TTGCCTGGCG ACGGTTATCG CCTGCTGGTC
CACCGGCAGG TACCGGCCAA TGATGGGGGG CTTTCCCTGG GCCAGGCAGT AATCGCTTGC
CGGAGGTGGC AACAATGTGT TTAG
 
Protein sequence
MAQRSNAEKK GIVRYRIIVK GIVQGVGFRP FIYNLARLYR LKGTVANTSQ GVLIEAEGQE 
ETVHAFLASL KEKHPPLSRI TALEWDVLEP CGYSSFAIIP SGEGLGKEAL IPPDVALCAD
CAREIMDPRD RHYGYPFTNC TNCGPRFTIV RGVPYDRAKT SMARFPMCPE CAREYHDPGN
RRFHAQPAAC PACGPQVELV DRQGRKVEGN WLELSWRFLQ DGKILAVKGL GGFHLVCDAK
NRAALKTLRR RKGREAKPLA VMCLLETARK YCYVGPEEEK LLSSPQAPIV ILTKRADCNL
PDELAPGMKT LGIMLPYTPL HLMLLNGPLE ILVMTSGNRN GLPLAKDNGR ALEELGGIAD
YFLWHNREIV NRCDDSVVAV IGDTAQILRR SRGYVPSPVK VAVKSSSPVL GAGGDMKNTF
CLLKGNQAFV SQHIGDLGSR EGEAHFFASL ENLKNLIGSE PEVVGYDMHP GYRSSRLAAG
IPAKAHFAVQ HHHAHMVSCL ADNGVDEDAI GVILDGTGYG TDGRLWGFEI LTGDCADFTR
EYHLAYVPLP GGEQAVRYPW RTAVAYLMKY LSAQGESLAD RLFQSRGQEL EVIKRLVATG
FNSPLSSSCG RLFDAVSALL GLCYHNSYEG QAAIELGEMV LDPAEGKRLI PYPFFIEGKV
IHPGGVIAGV AADLERGVAR EIIATRFHNT VLAMVREAVR RVAERTHIKT VALSGGAWQN
RYLFSLAKEI LPGDGYRLLV HRQVPANDGG LSLGQAVIAC RRWQQCV