Gene Acid345_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0869 
Symbol 
ID4068962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1085392 
End bp1086282 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content54% 
IMG OID637982876 
Productformate dehydrogenase beta subunit 
Protein accessionYP_589946 
Protein GI94967898 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01582] formate dehydrogenase, beta subunit, Fe-S containing 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCA AGCTTCTCCA GATCAAGGCG ATCTCGGGCC ACGCAGGAGT GGCGCCTGGC 
GCGAACATGT CGCGCGACTA TGACGTGTGC AAGCTCGTCG ACACAACCAC GTGTATCGGC
TGCAAGGCGT GCGAGGTCGC GTGTCTCGAG TGGAACGGAT ACGACTTCCA GCCAACAACG
TTCGACAATA CATACCAGAC GATGCCAGAC ACGTCATGGA ACTACTGGAA CTTGATCCGC
TTCGATGAGC ACGTGAACGA GGATGGAAGC TTTAGCTGGT TGATGCGCAA GGACCAATGC
ATGCATTGCG AGGACCCAGG ATGCCTGGCA GCGTGCCCCG CCGACGGCGC CATCGTGCAG
TATGAGAATG GCATCGTTGA TTTCAACCAG GCGAACTGCA TCGGTTGTCA GTATTGCGTT
ACCGGATGTC CCTTCAATAT TCCGAAATTC AATCCCACAA CGAAAAAAGT CTTCAAATGT
ACGCTGTGTT CCGATCGCGT TGGCGCGGGA CTCGAGCCTG CATGCATCAA GGCATGTCCG
ACCGGGTGCC TGCACTTCGG TTCTAAAGAA GACATGAAAG ATCTTGCGAA CAAGCGCGCC
ACGCAGCTCC GAGAGCACAC AGCTCATCAG AATGCGGGTG TGTACGATCC TGAAGGAGTT
GGGGGAACGC ACGTTATCTA TGTGCTTCAT GACATCAATA ATCCCGAGAA GTACGGTGGC
CTGCCGAAAA ACCCGACGGT AAATCCGATG GTTCGACTTT GGAAGGGACC ATTGAAATGG
ATTGGAGGGC TGGGAATGAT ATTCGGCGCC GTCGGTATTG CGTTCCATTA CTTGAGGTAC
GGGCCGAAGG AAGCTGAAAT TCATCCGGGA GGCGATCGTG AGCGAAATTA G
 
Protein sequence
MASKLLQIKA ISGHAGVAPG ANMSRDYDVC KLVDTTTCIG CKACEVACLE WNGYDFQPTT 
FDNTYQTMPD TSWNYWNLIR FDEHVNEDGS FSWLMRKDQC MHCEDPGCLA ACPADGAIVQ
YENGIVDFNQ ANCIGCQYCV TGCPFNIPKF NPTTKKVFKC TLCSDRVGAG LEPACIKACP
TGCLHFGSKE DMKDLANKRA TQLREHTAHQ NAGVYDPEGV GGTHVIYVLH DINNPEKYGG
LPKNPTVNPM VRLWKGPLKW IGGLGMIFGA VGIAFHYLRY GPKEAEIHPG GDRERN