Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3003 |
Symbol | |
ID | 4071558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3555657 |
End bp | 3558701 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985022 |
Product | Fe-S-cluster-containing hydrogenase |
Protein accession | YP_592078 |
Protein GI | 94970030 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00213363 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATAACG GATCAAAGAA GAACGGCGCG GACGTTTGCC CCAGCAAGAA GGGCAAGCTC GAACTCGCCG ACGTGAAACA GCAGTTGGCG GCGGCCAAGG ACGGCCCGCA ATATTGGCGC AGCCTCGATG AACTCTCCAA TACGGATGAG TTCCAGGAAA TGCTGCACCG CGAATTTCCG CGGCAGGCCT CGGAGTGGGT AGACGACGGT GGCAGTTCCC GCCGCGACTT CCTCAAGCTG ATGAGCGCTT CGTTGGCGCT CGCCGGACTT ACCGCCTGTA CCAAGCAGCC GATCGAGCCG ATCGTTCCTT ACGTTCGCCA GCCCGAAGAA TTGACCCTTG GCAAGCCCCT CTTCTTCGCA ACCGCGAACA CCGTCGGCGG CTACGCCGTG CCGGTTCTCG CGGAAAGCCA TGAAGGTCGG CCAACTAAGC TGGAAGGGAA CCCGCAGCAC CCCGCGACGC TCGGCGGTAC CGATGTCTTT ACTCAGGCCT CGGTTCTCAC CATGTACGAT CCCGACCGCT CGCAGGTCGT AATGCTCGAT AACGAGATCC GCACCTGGGG CTCGTTTGTC GGTGCCGTTG CGAATCCGCT GGCCGCGCAG AAGGCCGTGC AGGGCGCTGG ACTTCGACTT CTCACTCGCT CGACCACATC GCCAACGCTT GGCGCGCAGA TCAAGCAGCT TCTGCAGACT TATCCGCAGG CAAAGTTGGT GCAGTACGAC CCGGCGGGTC GCGACAACGC TCGCGCTGGT TCGCAACTTG CCTTCGGTCA GTACGTCGAG ACGCAGTACA ACCTCGACAA GGCCGACATA ATTCTTTCGC TCGATGGCGA TTTCCTCTCC AGCGGATTCC CCGGCTTTCA CAAGTACGCC CGCAACTTCT CGCAGCGCCG CCAGCCCGAC CTCAAAGAGA AAATGGTTCG GTTCTACATG GCGGAGAGCA CGCCGACCAA CACCGGCGGC AAGGCCGATC ACCGCATCCC GATGCGCGCC TCCGATGTCG AACAATTCGG ACGTGCCATC GCCGCGGGCA TCGGAGTAGC TGGCGCTGGT GGTTCAGCAA AGCAGGAGTG GCAAAACCAG GTTGCCGCAA TAGTCTCGGA TCTCAACAAG CACAAGGGCG CCGCCGTCGT CGTGGTCGGT GAGCATCAAC CACCCGCGGT TCATGCTCTC GCGCACTCCA TGAATGCCGC TCTCGGCGCG GTTGGCACGA CCGTTACGTA TACCGAGCCG ATCGAACAGA TTCCCGCGGA TCAAACTGCC GGCCTCAAGG AACTCGTCGC CGACATGAAC TCCGGCAAGG TGGACTTGCT GGTCGTCATG GGCGCGAACC CGGTATACGA AGCCCCCGCC GACCTCGCCT TCCTCGACGC CTTTAAGAAA GTCGCGGTCC GCATCCATCA CGGCCTCTAC GTCGACGAAA CCGCGGTCTT GTCGCACTGG CACATCAACG GTACGCACTT CCTCGAGCAG TGGGGCGATG TTCGTGCCTT CGACGGCACC GTCACCATCC AGCAACCGCT GATTGCTCCG CTCTACAACG GCAAGAGCCA GTACGAATTC GTCGCCGCGC TCAACGGGCA AGGTTCCACC AGCGGCTATG AACTGGTAAA GGGCACGTGG CAGAAGCAGC ACACGGGCGC CGATTTCGAA GCCTGGTGGC GCAAGGCTGT GCACGATGGC CTCATCGCCG GCACCGCCGC ACCCGCAAAA ACTGTCAGCG CGAAGGGCGC TCCCGCCGCG ACGAACGCCG CCAGCGACAG CGCGATGGAG CTCATCTTCC GCCGCGATCC CATGATTTAC GACGGCGAAT ACTCCAACAA CGGCTGGCTC CAGGAAGCTC CGAAGCCGAT CACGCAGCTC ACTTGGGACA ATCCCATCGA GATGAACGTG ACCCAGGCGG AGCAGATGGG AATCAAGACC GAGGACGAAC TCGAGATCAC CGTCGATGGC CGCAAGATCG TTGGCGGCGC TTGGCTCACG CCCGGTCACC CTAAGAATTC AGTCACTGTC TTCCTGGGCT ATGGCCGAAC GCGCGCTGGC CGAGTGGGCA CTGGCACAGG GTACAACGCC TATCAGGCCC GCACCTCCGA CAAACAGTGG ATCGTGAATG GCGTCCAGAT CGCGAAGACC GGCAAGAAGT TCCTCTTCGC CACCACGCAA GGCTGGCAAA ACATGGATGG CCGCGACCTG GTTCGCGTCG CCACCCTCGA AGACTTCATT GCCAATCCCG AGTTCGCGCA CGAAAAGACG GAAGCTCCAG TCGAAGGGCT CACCATCTTC CAGCCCTACG ACTACAGCGA AAAGCCGGGT GAGACTCGCT ACAAGTGGGG CATGGCGATT GATCTCAACT CCTGCATTGG TTGCAAGAGC TGCGTCGTCG CTTGCGTCTC TGAGAACAAC ATCCCGGTCG TTGGCAAGGA ACTCGTTAAA CGCGGCCGCC ACATGCACTG GCTCCGCGTC GACAACTATC ACGAGGGCTC GCCCGACGAT CCCAAGACCT ACTACCAGCC GGTGCCTTGC CAGCAATGCG AGAACGCGCC CTGCGAGTTG GTCTGCCCGG TCGGCGCCAC CGTTCACAGC AGTGAAGGCC TGAACGACAT GGTCTACAAC CGCTGCGTGG GCACGCGTTA TTGTTCGAAC AATTGCCCAT ACAAGGTGCG TCGCTTCAAC TTCCTGCTTT ATCAAGATTG GGAAACGCCA CAGTACAAGA TGATGCGCAA TCCGGATGTC TCGGTGCGCA GCCGTGGCGT GATGGAGAAG TGCAACTACT GCGTGCAGCG CATTACGCAC GCCCGCATCA ACTCTGAGCG CGATGGGCGC CGCATTGCGG ATGGCGAATT CACCACCGCG TGCGCGCAGG CGTGCCCGGC GAGCGCTATC ACCTTCGGCG ATCTCAACGA TCCCAATAGC CAGGTAGCCA AGCTTCGCGC GCAGCAGCGC AATTACGGAT TGCTGGAAGA CTTGAACAAC CGTCCGCGCA CCACATATAT GGCGGTGGTC CGCAACCCGA ACCCTGAACT CGAGCATGCC ATGGAGCGGA AGTAA
|
Protein sequence | MDNGSKKNGA DVCPSKKGKL ELADVKQQLA AAKDGPQYWR SLDELSNTDE FQEMLHREFP RQASEWVDDG GSSRRDFLKL MSASLALAGL TACTKQPIEP IVPYVRQPEE LTLGKPLFFA TANTVGGYAV PVLAESHEGR PTKLEGNPQH PATLGGTDVF TQASVLTMYD PDRSQVVMLD NEIRTWGSFV GAVANPLAAQ KAVQGAGLRL LTRSTTSPTL GAQIKQLLQT YPQAKLVQYD PAGRDNARAG SQLAFGQYVE TQYNLDKADI ILSLDGDFLS SGFPGFHKYA RNFSQRRQPD LKEKMVRFYM AESTPTNTGG KADHRIPMRA SDVEQFGRAI AAGIGVAGAG GSAKQEWQNQ VAAIVSDLNK HKGAAVVVVG EHQPPAVHAL AHSMNAALGA VGTTVTYTEP IEQIPADQTA GLKELVADMN SGKVDLLVVM GANPVYEAPA DLAFLDAFKK VAVRIHHGLY VDETAVLSHW HINGTHFLEQ WGDVRAFDGT VTIQQPLIAP LYNGKSQYEF VAALNGQGST SGYELVKGTW QKQHTGADFE AWWRKAVHDG LIAGTAAPAK TVSAKGAPAA TNAASDSAME LIFRRDPMIY DGEYSNNGWL QEAPKPITQL TWDNPIEMNV TQAEQMGIKT EDELEITVDG RKIVGGAWLT PGHPKNSVTV FLGYGRTRAG RVGTGTGYNA YQARTSDKQW IVNGVQIAKT GKKFLFATTQ GWQNMDGRDL VRVATLEDFI ANPEFAHEKT EAPVEGLTIF QPYDYSEKPG ETRYKWGMAI DLNSCIGCKS CVVACVSENN IPVVGKELVK RGRHMHWLRV DNYHEGSPDD PKTYYQPVPC QQCENAPCEL VCPVGATVHS SEGLNDMVYN RCVGTRYCSN NCPYKVRRFN FLLYQDWETP QYKMMRNPDV SVRSRGVMEK CNYCVQRITH ARINSERDGR RIADGEFTTA CAQACPASAI TFGDLNDPNS QVAKLRAQQR NYGLLEDLNN RPRTTYMAVV RNPNPELEHA MERK
|
| |