Gene Acid345_4240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4240 
Symbol 
ID4073167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5027197 
End bp5028888 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content60% 
IMG OID637986272 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_593314 
Protein GI94971266 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC GAATCACCAT CGATCCGATC ACCAGAATTG AAGGCCACCT CCGCGTAGAT 
GTCCAAGTGG ACAACAACTC GGTCACAAAC GCCTGGGCCT CGTGCACTAT GTGGCGCGGC
ATTGAAAACA TCCTCAAGGG CCGTGACCCC CGCGATGCAT GGCTCTTCAC CCAGCGTTTC
TGCGGTGTGT GCACCACCGT GCACGCGATG GCCAGCGTCC GCGCGGTAGA GGATGCGTTG
AAGCTGGAGA TCCCGCTAAA CGCGCAATAC ATCCGCAACC TCATTCTGAT TGCCCACGCG
CTGCACGATC ACATCGTGCA TTTCTACCAG CTCTCGGCTC TCGACTGGGT TGATGTCATG
CAGATTCCGA AAGCCGATCC TGCCGCGACT TCAAAGCTCG CCGAGAGCCT TTCGCCATGG
TTCCGCAACT CGCGCAACGA ACTCAAGCAG GCGCAGGACC GCGTGAACGC TGTTGCTGCC
AGCGGCCAGC TCGGCATCTT CGCCAACGGC TACTGGGGAC ACCCAGCGAT GCGCCTCTCG
CCCGAGGTGA ATCTGCTTGC CTTCTCGCAC TACATGCAGG CGCTGGAGTA TCAGCGCAAA
GCGCTGCAGA TCGTCGGCAT CCTCGGCTCA AAGACACCGC ACATCCAGAA CCTCACGCCC
GGTGGTGTCT CGAACGCGAT TGATCTCGAT AGTCAGTCGG CGCTGAACAT GGAGCGCCTT
GAGATGATCC GCGGCCTCTT TGCAGAGGTC TCGCGTTTCA TCAACGAGGT TTACCTCGTG
GATGTCTGCG CTGTAGCTTC GATGTACCCC GAGTGGTTCA ATATCGGCAG CGGCGTCACC
AATTACCTCG CCGTTCCGGA CTTGCCGCTC GACAGCCGCG GTTCCAGCTA CGATCTTCCG
GGCGGCTACA TCGGTGCAGG AGGACTGAAA TCGTTCCAGA CTGCTTCTGA CGACGCCTTC
CGCAAAGGCG TGACCGAAGA CGTAACCCAC GCCTACTACT CGGGCGATAA ACCGCTTCAT
CCCTGGGAAG GCGAGACTAA CCCGCAGTTC ACCGGCTGGA ACGGTGACGA GAAGTACTCC
TGGGTGAAGG CGCCACGCTT CAACGGCGAT CCTGCGCAGG TCGGTCCACT GGCACAGGTG
CTGATCGGTT ACACCCAAGG TCACGCGCTC ACCAAGAAGT ATGTCGGCCT AGCTGCGGAG
AAGGTTCATG CCGTCAGCGG CATCCAACTG CAACCGGCAA TGCTCCACTC CACTCTCGGC
CGCCACGCCG CGCGCGCCAT CCGCGCCGGC ATGCTCGCCG AGTTGGCGCA AAAGCATCTT
GACCTGCTCA CCAACAACAT CGCAAAGGGT GACTACTCCG TCTACAACGC ACCGGTCTTC
CCCAGCCACG AAGTAGAAGG TGTCGGCACC CACGAAGCTC CGCGCGGTAC GCTCTCGCAC
TGGATTGTGA TCAAAGACGA GAAGATCAAG AATTACCAGG CCGTCGTTCC TTCGACCTGG
AACGCCAGCC CGCGCGACCA AAAGAACGCG CATGGCCCGT ACGAGGCATC GCTACTGCAC
ACGCCGCTAG CGCGCCCGCA AGAGCCACTT GAGGTCTTGC GCACCATTCA CTCGTTCGAT
CCGTGCATGG CTTGTGCCTG CCACACCTTC GATCCATCCG GAAACAAGAT CGCAGCGGTC
AATATTTTAT GA
 
Protein sequence
MAKRITIDPI TRIEGHLRVD VQVDNNSVTN AWASCTMWRG IENILKGRDP RDAWLFTQRF 
CGVCTTVHAM ASVRAVEDAL KLEIPLNAQY IRNLILIAHA LHDHIVHFYQ LSALDWVDVM
QIPKADPAAT SKLAESLSPW FRNSRNELKQ AQDRVNAVAA SGQLGIFANG YWGHPAMRLS
PEVNLLAFSH YMQALEYQRK ALQIVGILGS KTPHIQNLTP GGVSNAIDLD SQSALNMERL
EMIRGLFAEV SRFINEVYLV DVCAVASMYP EWFNIGSGVT NYLAVPDLPL DSRGSSYDLP
GGYIGAGGLK SFQTASDDAF RKGVTEDVTH AYYSGDKPLH PWEGETNPQF TGWNGDEKYS
WVKAPRFNGD PAQVGPLAQV LIGYTQGHAL TKKYVGLAAE KVHAVSGIQL QPAMLHSTLG
RHAARAIRAG MLAELAQKHL DLLTNNIAKG DYSVYNAPVF PSHEVEGVGT HEAPRGTLSH
WIVIKDEKIK NYQAVVPSTW NASPRDQKNA HGPYEASLLH TPLARPQEPL EVLRTIHSFD
PCMACACHTF DPSGNKIAAV NIL