Gene Acid345_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1705 
Symbol 
ID4070488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2069604 
End bp2070719 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content61% 
IMG OID637983713 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_590780 
Protein GI94968732 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.340264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG TCGATGAGTA CCGCGACCGC GAAAAAGCTG AGCAATACGC ACGCGCCATT 
CGAACCGAAG CCACGCGCCC CTGGTCCATC ATGGAAGTCT GCGGCGGTCA GACCCACACC
ATCGTCAAGT ACGCGATTGA CGAAATCCTG CGCGACAAGA TCACGCTCCT CCATGGTCCC
GGCTGCCCAG TGTGCGTCAC GCCACTCGAG TTGATCGACA AAGCTTGCGA GATTGCGACG
CGGCCCGACG TCATCTTTTG TTCGTATGGC GACATGCTTC GAGTGCCCGG ATCGCACACA
GATTTGTTCA CCGTGAAAGC CAGGGGCGGC GATGTGCGCA TCGTGTACTC GCCGATGAAC
GCACTAGAGC TTGCGAAAGC GAACCCAACG AAGCAGGTTG TCTTCTTCGC GGTTGGCTTC
GAAACCACCG CGCCCGCCAA TGCCATGGCC GTATTCCAAG CCAAGCAGCA AGGCATCTCG
AACTTCTCGG TGCTCGTCTC GCACGTGCTT GTGCCGCCCG CGATTGAGGC CGTGCTGAGC
GCACCCGACA ACCGCACGCA AGCGTTTCTG GCTGCCGGAC ACGTCTGCAC TGTGATGGGA
TATGAAGAGT ACCGGCCGCT ATCAGAGAAA TATCGCGTGC CGATCGTCGT CACCGGCTTT
GAGCCACTCG ACATTCTCCA GGGCGTGTTG ATGTGCGTAC GGCAGCTTGA GAACGGTCGC
GCTGAGGTTG AGAACCAGTA CGCGCGCTCG GTGCGCGAGT TTGGCAATGT TCCGGCGCAG
GATCTCATCG GCCAGGTCTT TCGTGTGATT CCGCGCAAGT GGCGCGGCGT TGGCGAGATC
CCGCAGAGCG GCTTCGGTCT CGCAGCGGAG TTCGCCGAAT ACGACGCGGA GTTGCGCTTC
GGTGTCGCGG ACCTCACGGT AGAAGAGGAC CGCGAGTGCA TCGCCGGAGA GGTTTTGCGT
GGTGTGAAGA AGCCGCAGGA GTGTCCGGCG TTCGGCGGGC GCTGCACGCC AGATCATCCA
CTGGGAGCGA CAATGGTCTC GAATGAAGGC GCCTGTGCTG CGTACTACCA ATACCGGCGG
CACGAAGCTA AGGCCGCGGT CGGGAGCGAA CGATGA
 
Protein sequence
MKFVDEYRDR EKAEQYARAI RTEATRPWSI MEVCGGQTHT IVKYAIDEIL RDKITLLHGP 
GCPVCVTPLE LIDKACEIAT RPDVIFCSYG DMLRVPGSHT DLFTVKARGG DVRIVYSPMN
ALELAKANPT KQVVFFAVGF ETTAPANAMA VFQAKQQGIS NFSVLVSHVL VPPAIEAVLS
APDNRTQAFL AAGHVCTVMG YEEYRPLSEK YRVPIVVTGF EPLDILQGVL MCVRQLENGR
AEVENQYARS VREFGNVPAQ DLIGQVFRVI PRKWRGVGEI PQSGFGLAAE FAEYDAELRF
GVADLTVEED RECIAGEVLR GVKKPQECPA FGGRCTPDHP LGATMVSNEG ACAAYYQYRR
HEAKAAVGSE R