Gene Cphy_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2056 
Symbol 
ID5743756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2538743 
End bp2540677 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content38% 
IMG OID641293153 
Producthydrogenase, Fe-only 
Protein accessionYP_001559163 
Protein GI160880195 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000410653 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACA ACAAAACTTT TATGTTAAAT TCTCCTTTAG GATCTATCTT TCCAGGAAAT 
CAAGAAGAAG GTCTAAAAGA AATATCCAAA AATGATAAAA ACCTAGTTGC AGTATCCGGT
AAGATAAAGA AACCCGGTAT TGTTACTTTA ACAAGTAATA CGACTCTACG TGATATCATT
GAACTTTCTG GTGGTATGTT AAATGATAAA CCTTTAAAAG CTGCACAGCT TGGTCTACCT
TTTGGAGGTT TTTATACGAA GAAAGATATT GATCGTCAAA TAACTCCAGA CTCCTTTGAG
GAAGGAATAA GTCATTCAAT TATTATTCTA TCGGAAGAGG ATTGTATCAT ACAGTATGCA
AAGTTTTATA TTGACTTTTT GTTAGGTAAA ATCAATGGAG GCTATATCGA TCATTATATT
CCCGCACTGC CAGAGATACT TCAGATGGTT AAAGTTTTAA ATCGTATTAG CAAGGGAAGA
TCTAATATGA GAGATATTTT TCTATTAAGG GAACTCTCAC AAACAGTAAA AGAAAAGGTA
CATCAAAAAT ATAATCTCAT CGAAGAAGTG ATAATCTACT TCTATGATGA GATTAAGGAC
CATGTGGAAG ATAACCGATG CTACACCCTA CAGTGTAATA ATCTAACGAA ATTAACGATT
ATGGACAACT GTATCGGATG TGATAAATGT ACAAAAGTAT GTCCGGTGGA CTGTATTGTT
GGAGACTTTA AAGAACAGCA TTATATTGAT TACACAAGGT GTACCCATTG TGGTGCTTGT
CTTAGCACCT GTCCTGTAAA TGCTATAACG TCAGGAAATA ATTCTATCTT ATTTTTAAGA
GATTTAGCAA CACCAAATAA AATTGTAATT ACACAAATGG CTCCTTCCGT TCGTGTTGCA
ATCGGTGAGG CATTTGGATT TGAAACCGGT GCTAATGTAG AAAAAAAGAT TGCAGCAGGG
CTTCGAAAAC TAGGTGTTGA TTATGTATTC GATACCACCT GGGCAGCCGA CTTAACGATT
ATGGAAGAAG CGAATGAACT TGCAGAAAGG GTGAAGAGAT ATCTAGAAGG CGATAAGGAA
GTAAAACTCC CAATCCTTAC ATCCTGTTGC CCAGCTTGGG TAAAATTCAT TGAACAGAAC
TACGGAGATA TGTTAGATGT GCCCTCCACT TCAAAATCAC CGATGCAAAT GTTTGCTGCA
GTTGCAAAAG ACCTATGGGG TAAGGAGAAA GGACTTAGCA GAGAACAGAT TACTTCCGTC
GCTATTATGC CATGCATCGC GAAAAAATAT GAAGCTTCAA GGCCTGAGTT TTCCAGGGGG
CTTAATTATG ATGTGGATTA CGTTATTACA ACTACAGAGC TCATAGACAT CTTTAAGAAA
TCTAATATTG ACTTAAGTCA GTTAGAAGAT GAAGAAATCG ATCAGGTGAT GGGAGAATAT
ACCGGAGCAG GTATTATTTT CGGCCGAACC GGCGGTGTAA TTGAAGCTGC TACTAGAACT
GCGGTTGAAT TAATCACAGG GCAGCGTGTT GATAACATTG AATTTACCAG TCTACGTGGC
TTTGATGGAT TTCGAAGCTG TGAATTAACG GTAGGAGACT TAACATTACG AATTGGTGTA
ACCTATGGAC TTAAGGAAGC CAGAAAGATG TTAGATAAGA TTCGCTCAGG AGAGGAATTT
TACCATGCAA TAGAAATTAT GGCATGTACC GGTGGCTGTA TCGGAGGCGG CGGACAACCT
AGGGCTAAAA AAAGAGAGGA AACAATTAAA AGTAGGATGG AGGGTCTGAA TGAAATTGAT
CGTTCCTTAC CACTTCGCAG ATCCAATGAT AACCCAGCGG TACTCGCAAT TTATGAAAAA
TTCTTAGATT ATCCGCTTAG TAATAAGGCT CAGGAATTAC TTCATACCAG ATATTTTGTT
AAGAGGAAAA AATAA
 
Protein sequence
MNNNKTFMLN SPLGSIFPGN QEEGLKEISK NDKNLVAVSG KIKKPGIVTL TSNTTLRDII 
ELSGGMLNDK PLKAAQLGLP FGGFYTKKDI DRQITPDSFE EGISHSIIIL SEEDCIIQYA
KFYIDFLLGK INGGYIDHYI PALPEILQMV KVLNRISKGR SNMRDIFLLR ELSQTVKEKV
HQKYNLIEEV IIYFYDEIKD HVEDNRCYTL QCNNLTKLTI MDNCIGCDKC TKVCPVDCIV
GDFKEQHYID YTRCTHCGAC LSTCPVNAIT SGNNSILFLR DLATPNKIVI TQMAPSVRVA
IGEAFGFETG ANVEKKIAAG LRKLGVDYVF DTTWAADLTI MEEANELAER VKRYLEGDKE
VKLPILTSCC PAWVKFIEQN YGDMLDVPST SKSPMQMFAA VAKDLWGKEK GLSREQITSV
AIMPCIAKKY EASRPEFSRG LNYDVDYVIT TTELIDIFKK SNIDLSQLED EEIDQVMGEY
TGAGIIFGRT GGVIEAATRT AVELITGQRV DNIEFTSLRG FDGFRSCELT VGDLTLRIGV
TYGLKEARKM LDKIRSGEEF YHAIEIMACT GGCIGGGGQP RAKKREETIK SRMEGLNEID
RSLPLRRSND NPAVLAIYEK FLDYPLSNKA QELLHTRYFV KRKK