Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2056 |
Symbol | |
ID | 5743756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2538743 |
End bp | 2540677 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641293153 |
Product | hydrogenase, Fe-only |
Protein accession | YP_001559163 |
Protein GI | 160880195 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000410653 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACA ACAAAACTTT TATGTTAAAT TCTCCTTTAG GATCTATCTT TCCAGGAAAT CAAGAAGAAG GTCTAAAAGA AATATCCAAA AATGATAAAA ACCTAGTTGC AGTATCCGGT AAGATAAAGA AACCCGGTAT TGTTACTTTA ACAAGTAATA CGACTCTACG TGATATCATT GAACTTTCTG GTGGTATGTT AAATGATAAA CCTTTAAAAG CTGCACAGCT TGGTCTACCT TTTGGAGGTT TTTATACGAA GAAAGATATT GATCGTCAAA TAACTCCAGA CTCCTTTGAG GAAGGAATAA GTCATTCAAT TATTATTCTA TCGGAAGAGG ATTGTATCAT ACAGTATGCA AAGTTTTATA TTGACTTTTT GTTAGGTAAA ATCAATGGAG GCTATATCGA TCATTATATT CCCGCACTGC CAGAGATACT TCAGATGGTT AAAGTTTTAA ATCGTATTAG CAAGGGAAGA TCTAATATGA GAGATATTTT TCTATTAAGG GAACTCTCAC AAACAGTAAA AGAAAAGGTA CATCAAAAAT ATAATCTCAT CGAAGAAGTG ATAATCTACT TCTATGATGA GATTAAGGAC CATGTGGAAG ATAACCGATG CTACACCCTA CAGTGTAATA ATCTAACGAA ATTAACGATT ATGGACAACT GTATCGGATG TGATAAATGT ACAAAAGTAT GTCCGGTGGA CTGTATTGTT GGAGACTTTA AAGAACAGCA TTATATTGAT TACACAAGGT GTACCCATTG TGGTGCTTGT CTTAGCACCT GTCCTGTAAA TGCTATAACG TCAGGAAATA ATTCTATCTT ATTTTTAAGA GATTTAGCAA CACCAAATAA AATTGTAATT ACACAAATGG CTCCTTCCGT TCGTGTTGCA ATCGGTGAGG CATTTGGATT TGAAACCGGT GCTAATGTAG AAAAAAAGAT TGCAGCAGGG CTTCGAAAAC TAGGTGTTGA TTATGTATTC GATACCACCT GGGCAGCCGA CTTAACGATT ATGGAAGAAG CGAATGAACT TGCAGAAAGG GTGAAGAGAT ATCTAGAAGG CGATAAGGAA GTAAAACTCC CAATCCTTAC ATCCTGTTGC CCAGCTTGGG TAAAATTCAT TGAACAGAAC TACGGAGATA TGTTAGATGT GCCCTCCACT TCAAAATCAC CGATGCAAAT GTTTGCTGCA GTTGCAAAAG ACCTATGGGG TAAGGAGAAA GGACTTAGCA GAGAACAGAT TACTTCCGTC GCTATTATGC CATGCATCGC GAAAAAATAT GAAGCTTCAA GGCCTGAGTT TTCCAGGGGG CTTAATTATG ATGTGGATTA CGTTATTACA ACTACAGAGC TCATAGACAT CTTTAAGAAA TCTAATATTG ACTTAAGTCA GTTAGAAGAT GAAGAAATCG ATCAGGTGAT GGGAGAATAT ACCGGAGCAG GTATTATTTT CGGCCGAACC GGCGGTGTAA TTGAAGCTGC TACTAGAACT GCGGTTGAAT TAATCACAGG GCAGCGTGTT GATAACATTG AATTTACCAG TCTACGTGGC TTTGATGGAT TTCGAAGCTG TGAATTAACG GTAGGAGACT TAACATTACG AATTGGTGTA ACCTATGGAC TTAAGGAAGC CAGAAAGATG TTAGATAAGA TTCGCTCAGG AGAGGAATTT TACCATGCAA TAGAAATTAT GGCATGTACC GGTGGCTGTA TCGGAGGCGG CGGACAACCT AGGGCTAAAA AAAGAGAGGA AACAATTAAA AGTAGGATGG AGGGTCTGAA TGAAATTGAT CGTTCCTTAC CACTTCGCAG ATCCAATGAT AACCCAGCGG TACTCGCAAT TTATGAAAAA TTCTTAGATT ATCCGCTTAG TAATAAGGCT CAGGAATTAC TTCATACCAG ATATTTTGTT AAGAGGAAAA AATAA
|
Protein sequence | MNNNKTFMLN SPLGSIFPGN QEEGLKEISK NDKNLVAVSG KIKKPGIVTL TSNTTLRDII ELSGGMLNDK PLKAAQLGLP FGGFYTKKDI DRQITPDSFE EGISHSIIIL SEEDCIIQYA KFYIDFLLGK INGGYIDHYI PALPEILQMV KVLNRISKGR SNMRDIFLLR ELSQTVKEKV HQKYNLIEEV IIYFYDEIKD HVEDNRCYTL QCNNLTKLTI MDNCIGCDKC TKVCPVDCIV GDFKEQHYID YTRCTHCGAC LSTCPVNAIT SGNNSILFLR DLATPNKIVI TQMAPSVRVA IGEAFGFETG ANVEKKIAAG LRKLGVDYVF DTTWAADLTI MEEANELAER VKRYLEGDKE VKLPILTSCC PAWVKFIEQN YGDMLDVPST SKSPMQMFAA VAKDLWGKEK GLSREQITSV AIMPCIAKKY EASRPEFSRG LNYDVDYVIT TTELIDIFKK SNIDLSQLED EEIDQVMGEY TGAGIIFGRT GGVIEAATRT AVELITGQRV DNIEFTSLRG FDGFRSCELT VGDLTLRIGV TYGLKEARKM LDKIRSGEEF YHAIEIMACT GGCIGGGGQP RAKKREETIK SRMEGLNEID RSLPLRRSND NPAVLAIYEK FLDYPLSNKA QELLHTRYFV KRKK
|
| |