Gene Acid345_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1704 
Symbol 
ID4070487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2068555 
End bp2069607 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content61% 
IMG OID637983712 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_590779 
Protein GI94968731 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.337224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGA AATCGTTTTC CCTCAACTGT CCCCTGCCAC TAAATACCTC GGCCGTTGTG 
CTACTGGCGC ATGGAGGCGG TGGCCGCCTG ATGCACCAAT TCCTTGAGCA ATACGTCATT
CCAGCGTTCC AGAACGACCA GCTCGCGACG CGGCACGACG GCGCAGTCTT CGACGTGAAA
AACGCGCGGC TCGCCTTCAC GACGGACTCC TACGTTGTTA GCCCACTGTT TTTTCCCGGT
GCGGACATCG GCTCCCTGGC CGTGAGCGGC ACCGTGAACG ATCTCGCAAT GTGCGGTGCC
CGGCCGCTGT TCCTGAGCGC AGCATTCATC ATGGAAGAAG GACTTCCCTT CGAAACGCTG
GAGCGCGTGA TTGCTTCCAT GCGCGCGACC GCTGAGGCCG CGGGCGTGCA TATCGTCACC
GGCGATACCA AGGTTGTGGA CCGCGGCAAG TGCGACAAAC TCTTCATCAA CACCGCCGGC
GTCGGCGTCA TTGAGCATAC GCAAACGATC GGACCGAGCG CGGTCCGTTC GGGCGACGCG
ATCCTTCTCA GTGGCGACGT TGGCCGACAC GGCGTAGCCG TGATGGCTAC ACGAGAAGGA
CTGGAGTTCG CGACGGCGAT TGAATCTGAC TGCGCTCCGC TCGCAGAGCC AGTGCTTGCG
CTACTCGAAG CCGGCATCGA AGTTCATTGC TTCCGCGATC TGACGCGGGG CGGATTGGCC
ACCACGCTGG TGGAGATTGC CGAATCGAGC AAGCTTGGAA TTCATATCGC TGAACGCAGC
GTTCCCGTGG AAGAAGGCGT GCGCGGAGCG TGCGAAATTC TGGGCCTGGA TCCACTCTAC
CTGGCGAATG AAGGCAGATT CGTTGGGTTC GTTGCTGAAA GAGATGCGGA AAATGCGCTG
GATATCATGC GGAAGCACGC CGTGTCGGCG CAGGCCAAGC GGATCGGAAC CGTGGCCGAG
GCGCCCGCCT CGCTGGTCAC GCTCGAAAAT CTGCTTGGGA CAAAGCGCAT CCTCGACATG
GAACGCGGCG AACCTCTTCC ACGGATTTGC TGA
 
Protein sequence
MSVKSFSLNC PLPLNTSAVV LLAHGGGGRL MHQFLEQYVI PAFQNDQLAT RHDGAVFDVK 
NARLAFTTDS YVVSPLFFPG ADIGSLAVSG TVNDLAMCGA RPLFLSAAFI MEEGLPFETL
ERVIASMRAT AEAAGVHIVT GDTKVVDRGK CDKLFINTAG VGVIEHTQTI GPSAVRSGDA
ILLSGDVGRH GVAVMATREG LEFATAIESD CAPLAEPVLA LLEAGIEVHC FRDLTRGGLA
TTLVEIAESS KLGIHIAERS VPVEEGVRGA CEILGLDPLY LANEGRFVGF VAERDAENAL
DIMRKHAVSA QAKRIGTVAE APASLVTLEN LLGTKRILDM ERGEPLPRIC