Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3972 |
Symbol | |
ID | 8139346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4555684 |
End bp | 4557105 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871588 |
Product | nickel-dependent hydrogenase large subunit |
Protein accession | YP_003023746 |
Protein GI | 253702557 |
COG category | [C] Energy production and conversion |
COG ID | [COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.00000529156 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGATGA AGCGAACTTT GAAGATTGAC CCAGTGACCC GGATCGAGGG GCACGCCAAG GTGTTCATCA ACCTGGACGA GGCCGGGGCG CTGGAAAGCG CGGGGCTCGT GGTGAACGAG CTGCGCGGCT TCGAGAAGAT CCTGATCGGG ATGGAGGCGG ACCGGATGCC GCACGTGACG GCGCGCATCT GCGGGGTCTG CCCCACCGCG CACCATATCG CCGCCTGCAA CGCACTGGAC CACGCGGCAG GGGTCACGCC CCCCCCCGCC GCGCTTCTTC TGCGGGAGCT GATGTATCTC GGGCACATCA TCCACTCGCA CTCGCTTTCC ATCTTCGTGC TGCAGGGGCC GGACCTGGTG CTGGGACTCG ACGCGGACCC GGCGATCCGG AACATCGTGG GGATCGTGCA GGCGAACCCG GAGCTCGCCA AACTCGCCCT GCGCCTGAGG AGCATCGGCC AGAAGATCAA CGAGATGGTG GGGGGGAGGG GGACGCACCC GGTGACCTCG GTGGCCGGCG GCATCGCCTT CGTGCTCGAC AAGGAGAAGC TAAAGGCGCT CAAGGAGTGG GTGGACGAGG CGCGGGGGGT GCTGCCGCAG GTGGTCCCGG CCGTCAAGGG GCTCCTGATG CGGGCCCTGG AAGCGCACCC GGAGATGGGA GAAAAATGGA TCGTCCCGAG TTTCGGGATG GGTACCGTGC AGGATGGAGC GGTCTCCCTG ATCTCGGGGG AGCTTCGCGT CATCGACGAC ACCGGCGCCA CCGTTTTGGA GTTCGGGATC GAGGAGTACG ACCGGTACCT GCGCGAGTCT GTCGTCGAGT GGTCCTACAT GAAGAAGGTG CAGGTAGAGC TGGACGGCGA GCTGCACGAC TACCGGGTGG GGCCCATGGC GCGGATGAAC GTGGCGCGCC GTTTCGGGAC CGAAATGGCC GACGCCGAGT ACGCCGAGTT CGCCAGGTTG GGGGGAGCCC CCTGCCACAC CACCGTGTTC CAGACCTACG CCAAGCTGAT CGAGATCGTC TGGGCCATCG AGCGGGCGGG GGAGATCCTG CGCGACAAGG CGATCCGCGG GGAGACCCGG GTCCCGGTCC GCTTCCAGGG GGGGAGGGGG GTGGGGCACG TCGAGGCGCC GCGCGGCACG CTGATCCACG ACTACCAGAT CGACGAGCGC GGGATCGTGC GGGCGGCGAA CCTGATCGTC GCCACCCAGC AGAACTACTC GCTCATCAAC CGCTCCATCG AGCAGTCCGC CCAGTCCCAC GTGATCGACC GCCCCGACGA CCGGGCGCTG ATGAACGCCG TCGAGTTCAG CATCCGCTGC TACGACCCCT GCCTCTCCTG CGCCACCCAC GCTCTCGGGC GGATGCCGCT GGAGGTAGCG GTCAGGCGGG GCGCGGAGAC GGTCAAGACC CTTTGGAGGT AA
|
Protein sequence | MPMKRTLKID PVTRIEGHAK VFINLDEAGA LESAGLVVNE LRGFEKILIG MEADRMPHVT ARICGVCPTA HHIAACNALD HAAGVTPPPA ALLLRELMYL GHIIHSHSLS IFVLQGPDLV LGLDADPAIR NIVGIVQANP ELAKLALRLR SIGQKINEMV GGRGTHPVTS VAGGIAFVLD KEKLKALKEW VDEARGVLPQ VVPAVKGLLM RALEAHPEMG EKWIVPSFGM GTVQDGAVSL ISGELRVIDD TGATVLEFGI EEYDRYLRES VVEWSYMKKV QVELDGELHD YRVGPMARMN VARRFGTEMA DAEYAEFARL GGAPCHTTVF QTYAKLIEIV WAIERAGEIL RDKAIRGETR VPVRFQGGRG VGHVEAPRGT LIHDYQIDER GIVRAANLIV ATQQNYSLIN RSIEQSAQSH VIDRPDDRAL MNAVEFSIRC YDPCLSCATH ALGRMPLEVA VRRGAETVKT LWR
|
| |