Gene GM21_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1130 
Symbol 
ID8136452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1321703 
End bp1323979 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content67% 
IMG OID644868741 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_003020949 
Protein GI253699760 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones148 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCG CGACGGAGCG TGCCGCCATC GAGATCGGCG GCATCGTGCA GGGGGTGGGT 
TTCCGCCCCT TCGTCTACCG GCTCGCCAAT CGCCTCGGCC TCTCCGGCTG GGTCCGCAAC
ACCGGGGAAG GCGTGCAGAT CGAGGTCGAA GGAGATGCAG CCGCCATCGA AGGATTTCGC
CGTGCCGTGC GCGAAGAGGC GCCCCCTTTG GCGGTGATCT GCCACCTGCG CGCGCGGCCG
GTTCCCGCGT TGAGAGAAAT CGGCTTCGCC ATCGTCGAGA GCGCCAGGAG CGCCGCCGGC
GGCGAGGTCT CCCCCGATTG CGATGTCTGC GACGATTGTC TTGCCGAACT CTTCGACCCT
GATAACCGTC GCCACGGCTA CCCGTTCATC AACTGCACCA ACTGCGGCCC GCGCTATTCC
ATCATCACCG GCATCCCTTA TGACCGCCCC GCCACCACCA TGGCAGGCTT CGTCATGTGC
GACGACTGCC TCGCCGAATA CCACGATCCC TGCAACCGGC GCTTCCACGC GCAACCCAAC
GCCTGCCCGG TCTGCGGACC GCGTTTGTCC CTTTTGGAGG CAAGCGGCGC ACCGGTCTGC
GGCGACGCGC TGCAGCTGAC CCTGGAGGCC CTGGCCCAAG GGAAGATCGT CGCTGTGAAG
GGGGTAGGAG GGTACCACCT GGCGGTGGAC GCCGCCAACC AAGCCGCGCT GGAGCGGCTC
AGGCAGCGCA AAAAGCGCGA CGAGAAGCCG TTTGCCATGC TGGCCGCCGA CCTCGACGCG
GTGCGCCGCC ACGCTCACTG CTCCGAGTTG GAGGGGCGGC TGCTGCTAGG GGTGGAGCGC
CCCGTCGTGC TGATGAGGAA ACTTCCCGGC ACCTCTATCA GCGACCTGGT CGCGCCCGGC
AACGGCTGGT TCGGGTTCAT GCTCCCGGGA AACCCGCTGC AGCACCTGCT GGCCGCAGGC
TCGCGCGCCC CCCTGGTAAT GACCAGCGGC AACCTTTCCG ATGAACCGAT CGCCTACCGG
GACGGCGAGG CGCTGGAGAT GCTCTCCGGT ATCGCGGACC TTTTCCTTGC CCACGACCGG
GAGATCCATA CGCGCACCGA CGATTCGGTG CTTCGCCTCT ACCGAGGGGA GCCGCTTTTC
CTGCGCCGCT CGCGCGGTTA CGTGCCGCGC GCCGTGCAAC TGCTGGCAGA GCAGGCAAGC
GTTCTCGCCG TCGGGGGCGA ACTGAAGTCG ACCCTTTGCC TCACCCGCGG GGACCGCGCT
TTCATGAGTC AGCATGTCGG CGATCTTAAG AATCCTGCGA CGCTGGCCTC GCTTAAGCAG
AGCGCGTCGG ACCTGCAGCG CCTACTAGAG ATAACGCCTG CGCTGGTGGC GCACGACCTG
CATCCGGACT ACCTTTCCAC CCATTACGCA GCGGCTCTCG GGCTCCCGGC CGTGGGAGTG
CAGCATCACC ACGCCCACAT GGCCTCCTGC ATGGCGGAGA ATGGGCTTAA CGGCGAGGTT
ATCGGGGTCA TCCTGGACGG CACCGGCTAC GGGGGCGACG GCACCATCTG GGGGGGCGAG
TTTCTTGTGG GCGGCTACTG CCACTTCGAA AGGCGCGCCC ACTTCGCCCA GATGAGGCTG
CCGGGGGGGG ACGCCGCCGT CAAGGAGCCG TACCGCATGG CGCTCTCGGT GCTCTACTCC
CTGCACGGCG GCCGGCTTTT CGATCAGCCG CTATCGGTTC TTTCCGAAGT GGCGCAGGCC
GACCGGCCGC TTTTTCTGAA GATGCTGGAG AAAGGAATCA ACTCACCGCT CACCTCGAGT
TGCGGCCGGC TTTTCGATGC AGTCTCCGCC TTGATCGGCG TGCGCAGCCG CATCAGCTAC
GAGGGGCAGG CGGCCATCGA GCTGGAGGCC CTGGCCGAAC AGGGGGGGGA AGTGGAGCCA
TACCCGTACC GGGTGCGGGA AGAGTGGGGG GCTGTGTTGG ATTTCACCCC CGCCATCGCC
GCCATATGCG CCGACCTTGC CCAAGGGAGG AACCGTGCCG ACATCGCGCG CGGCTTTCAC
ATAACCGTCG CCCGCGGCGT CCTTGACGTC TGCCGCAAGG TGCGTGAGGA GACCGGCTAC
GTGCGGGTTG TGCTCTCCGG CGGGGTATTC CAAAACCGGC TCTTGACCGA GGAGGTGGCG
GAGCTGCTGG CAGGCGACGC GTTCCAGGTC TACTGCCACC GGCTGGTCCC ACCGAACGAC
GGCGGCCTCG CCTTGGGCCA GGCGGCGATA GCAGGGGCGA TGCAGGCCCG CGGTTAG
 
Protein sequence
MQSATERAAI EIGGIVQGVG FRPFVYRLAN RLGLSGWVRN TGEGVQIEVE GDAAAIEGFR 
RAVREEAPPL AVICHLRARP VPALREIGFA IVESARSAAG GEVSPDCDVC DDCLAELFDP
DNRRHGYPFI NCTNCGPRYS IITGIPYDRP ATTMAGFVMC DDCLAEYHDP CNRRFHAQPN
ACPVCGPRLS LLEASGAPVC GDALQLTLEA LAQGKIVAVK GVGGYHLAVD AANQAALERL
RQRKKRDEKP FAMLAADLDA VRRHAHCSEL EGRLLLGVER PVVLMRKLPG TSISDLVAPG
NGWFGFMLPG NPLQHLLAAG SRAPLVMTSG NLSDEPIAYR DGEALEMLSG IADLFLAHDR
EIHTRTDDSV LRLYRGEPLF LRRSRGYVPR AVQLLAEQAS VLAVGGELKS TLCLTRGDRA
FMSQHVGDLK NPATLASLKQ SASDLQRLLE ITPALVAHDL HPDYLSTHYA AALGLPAVGV
QHHHAHMASC MAENGLNGEV IGVILDGTGY GGDGTIWGGE FLVGGYCHFE RRAHFAQMRL
PGGDAAVKEP YRMALSVLYS LHGGRLFDQP LSVLSEVAQA DRPLFLKMLE KGINSPLTSS
CGRLFDAVSA LIGVRSRISY EGQAAIELEA LAEQGGEVEP YPYRVREEWG AVLDFTPAIA
AICADLAQGR NRADIARGFH ITVARGVLDV CRKVREETGY VRVVLSGGVF QNRLLTEEVA
ELLAGDAFQV YCHRLVPPND GGLALGQAAI AGAMQARG