Gene Acid345_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3003 
Symbol 
ID4071558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3555657 
End bp3558701 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content61% 
IMG OID637985022 
ProductFe-S-cluster-containing hydrogenase 
Protein accessionYP_592078 
Protein GI94970030 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00213363 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAACG GATCAAAGAA GAACGGCGCG GACGTTTGCC CCAGCAAGAA GGGCAAGCTC 
GAACTCGCCG ACGTGAAACA GCAGTTGGCG GCGGCCAAGG ACGGCCCGCA ATATTGGCGC
AGCCTCGATG AACTCTCCAA TACGGATGAG TTCCAGGAAA TGCTGCACCG CGAATTTCCG
CGGCAGGCCT CGGAGTGGGT AGACGACGGT GGCAGTTCCC GCCGCGACTT CCTCAAGCTG
ATGAGCGCTT CGTTGGCGCT CGCCGGACTT ACCGCCTGTA CCAAGCAGCC GATCGAGCCG
ATCGTTCCTT ACGTTCGCCA GCCCGAAGAA TTGACCCTTG GCAAGCCCCT CTTCTTCGCA
ACCGCGAACA CCGTCGGCGG CTACGCCGTG CCGGTTCTCG CGGAAAGCCA TGAAGGTCGG
CCAACTAAGC TGGAAGGGAA CCCGCAGCAC CCCGCGACGC TCGGCGGTAC CGATGTCTTT
ACTCAGGCCT CGGTTCTCAC CATGTACGAT CCCGACCGCT CGCAGGTCGT AATGCTCGAT
AACGAGATCC GCACCTGGGG CTCGTTTGTC GGTGCCGTTG CGAATCCGCT GGCCGCGCAG
AAGGCCGTGC AGGGCGCTGG ACTTCGACTT CTCACTCGCT CGACCACATC GCCAACGCTT
GGCGCGCAGA TCAAGCAGCT TCTGCAGACT TATCCGCAGG CAAAGTTGGT GCAGTACGAC
CCGGCGGGTC GCGACAACGC TCGCGCTGGT TCGCAACTTG CCTTCGGTCA GTACGTCGAG
ACGCAGTACA ACCTCGACAA GGCCGACATA ATTCTTTCGC TCGATGGCGA TTTCCTCTCC
AGCGGATTCC CCGGCTTTCA CAAGTACGCC CGCAACTTCT CGCAGCGCCG CCAGCCCGAC
CTCAAAGAGA AAATGGTTCG GTTCTACATG GCGGAGAGCA CGCCGACCAA CACCGGCGGC
AAGGCCGATC ACCGCATCCC GATGCGCGCC TCCGATGTCG AACAATTCGG ACGTGCCATC
GCCGCGGGCA TCGGAGTAGC TGGCGCTGGT GGTTCAGCAA AGCAGGAGTG GCAAAACCAG
GTTGCCGCAA TAGTCTCGGA TCTCAACAAG CACAAGGGCG CCGCCGTCGT CGTGGTCGGT
GAGCATCAAC CACCCGCGGT TCATGCTCTC GCGCACTCCA TGAATGCCGC TCTCGGCGCG
GTTGGCACGA CCGTTACGTA TACCGAGCCG ATCGAACAGA TTCCCGCGGA TCAAACTGCC
GGCCTCAAGG AACTCGTCGC CGACATGAAC TCCGGCAAGG TGGACTTGCT GGTCGTCATG
GGCGCGAACC CGGTATACGA AGCCCCCGCC GACCTCGCCT TCCTCGACGC CTTTAAGAAA
GTCGCGGTCC GCATCCATCA CGGCCTCTAC GTCGACGAAA CCGCGGTCTT GTCGCACTGG
CACATCAACG GTACGCACTT CCTCGAGCAG TGGGGCGATG TTCGTGCCTT CGACGGCACC
GTCACCATCC AGCAACCGCT GATTGCTCCG CTCTACAACG GCAAGAGCCA GTACGAATTC
GTCGCCGCGC TCAACGGGCA AGGTTCCACC AGCGGCTATG AACTGGTAAA GGGCACGTGG
CAGAAGCAGC ACACGGGCGC CGATTTCGAA GCCTGGTGGC GCAAGGCTGT GCACGATGGC
CTCATCGCCG GCACCGCCGC ACCCGCAAAA ACTGTCAGCG CGAAGGGCGC TCCCGCCGCG
ACGAACGCCG CCAGCGACAG CGCGATGGAG CTCATCTTCC GCCGCGATCC CATGATTTAC
GACGGCGAAT ACTCCAACAA CGGCTGGCTC CAGGAAGCTC CGAAGCCGAT CACGCAGCTC
ACTTGGGACA ATCCCATCGA GATGAACGTG ACCCAGGCGG AGCAGATGGG AATCAAGACC
GAGGACGAAC TCGAGATCAC CGTCGATGGC CGCAAGATCG TTGGCGGCGC TTGGCTCACG
CCCGGTCACC CTAAGAATTC AGTCACTGTC TTCCTGGGCT ATGGCCGAAC GCGCGCTGGC
CGAGTGGGCA CTGGCACAGG GTACAACGCC TATCAGGCCC GCACCTCCGA CAAACAGTGG
ATCGTGAATG GCGTCCAGAT CGCGAAGACC GGCAAGAAGT TCCTCTTCGC CACCACGCAA
GGCTGGCAAA ACATGGATGG CCGCGACCTG GTTCGCGTCG CCACCCTCGA AGACTTCATT
GCCAATCCCG AGTTCGCGCA CGAAAAGACG GAAGCTCCAG TCGAAGGGCT CACCATCTTC
CAGCCCTACG ACTACAGCGA AAAGCCGGGT GAGACTCGCT ACAAGTGGGG CATGGCGATT
GATCTCAACT CCTGCATTGG TTGCAAGAGC TGCGTCGTCG CTTGCGTCTC TGAGAACAAC
ATCCCGGTCG TTGGCAAGGA ACTCGTTAAA CGCGGCCGCC ACATGCACTG GCTCCGCGTC
GACAACTATC ACGAGGGCTC GCCCGACGAT CCCAAGACCT ACTACCAGCC GGTGCCTTGC
CAGCAATGCG AGAACGCGCC CTGCGAGTTG GTCTGCCCGG TCGGCGCCAC CGTTCACAGC
AGTGAAGGCC TGAACGACAT GGTCTACAAC CGCTGCGTGG GCACGCGTTA TTGTTCGAAC
AATTGCCCAT ACAAGGTGCG TCGCTTCAAC TTCCTGCTTT ATCAAGATTG GGAAACGCCA
CAGTACAAGA TGATGCGCAA TCCGGATGTC TCGGTGCGCA GCCGTGGCGT GATGGAGAAG
TGCAACTACT GCGTGCAGCG CATTACGCAC GCCCGCATCA ACTCTGAGCG CGATGGGCGC
CGCATTGCGG ATGGCGAATT CACCACCGCG TGCGCGCAGG CGTGCCCGGC GAGCGCTATC
ACCTTCGGCG ATCTCAACGA TCCCAATAGC CAGGTAGCCA AGCTTCGCGC GCAGCAGCGC
AATTACGGAT TGCTGGAAGA CTTGAACAAC CGTCCGCGCA CCACATATAT GGCGGTGGTC
CGCAACCCGA ACCCTGAACT CGAGCATGCC ATGGAGCGGA AGTAA
 
Protein sequence
MDNGSKKNGA DVCPSKKGKL ELADVKQQLA AAKDGPQYWR SLDELSNTDE FQEMLHREFP 
RQASEWVDDG GSSRRDFLKL MSASLALAGL TACTKQPIEP IVPYVRQPEE LTLGKPLFFA
TANTVGGYAV PVLAESHEGR PTKLEGNPQH PATLGGTDVF TQASVLTMYD PDRSQVVMLD
NEIRTWGSFV GAVANPLAAQ KAVQGAGLRL LTRSTTSPTL GAQIKQLLQT YPQAKLVQYD
PAGRDNARAG SQLAFGQYVE TQYNLDKADI ILSLDGDFLS SGFPGFHKYA RNFSQRRQPD
LKEKMVRFYM AESTPTNTGG KADHRIPMRA SDVEQFGRAI AAGIGVAGAG GSAKQEWQNQ
VAAIVSDLNK HKGAAVVVVG EHQPPAVHAL AHSMNAALGA VGTTVTYTEP IEQIPADQTA
GLKELVADMN SGKVDLLVVM GANPVYEAPA DLAFLDAFKK VAVRIHHGLY VDETAVLSHW
HINGTHFLEQ WGDVRAFDGT VTIQQPLIAP LYNGKSQYEF VAALNGQGST SGYELVKGTW
QKQHTGADFE AWWRKAVHDG LIAGTAAPAK TVSAKGAPAA TNAASDSAME LIFRRDPMIY
DGEYSNNGWL QEAPKPITQL TWDNPIEMNV TQAEQMGIKT EDELEITVDG RKIVGGAWLT
PGHPKNSVTV FLGYGRTRAG RVGTGTGYNA YQARTSDKQW IVNGVQIAKT GKKFLFATTQ
GWQNMDGRDL VRVATLEDFI ANPEFAHEKT EAPVEGLTIF QPYDYSEKPG ETRYKWGMAI
DLNSCIGCKS CVVACVSENN IPVVGKELVK RGRHMHWLRV DNYHEGSPDD PKTYYQPVPC
QQCENAPCEL VCPVGATVHS SEGLNDMVYN RCVGTRYCSN NCPYKVRRFN FLLYQDWETP
QYKMMRNPDV SVRSRGVMEK CNYCVQRITH ARINSERDGR RIADGEFTTA CAQACPASAI
TFGDLNDPNS QVAKLRAQQR NYGLLEDLNN RPRTTYMAVV RNPNPELEHA MERK