Gene Acid345_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0979 
Symbol 
ID4068646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1239056 
End bp1240261 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content61% 
IMG OID637982986 
Productchaperone DnaJ-like 
Protein accessionYP_590056 
Protein GI94968008 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00109475 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACAC AGACCAAAGA TTATTACGGC GCGTTAGGGG TTAAGAAGAA TGCTTCGGCG 
GAGGAGATCC GCAAGGCGTT CCGCAAACTT GCGCGCAAAT ATCACCCGGA CGTGAACCCC
GGGGACAAGA AGGCCGAAGA CAAGTTCAAG GAAATCTCCG AGGCGAACGA AGTCCTGAGC
GATCCAAAGA AGCGGAAGAT TTACGACCAG CTCGGGTTCT ATTCGGACAA CATTGATCCG
GCTGCGGCCG AGGCGTATGC GCGTAACGGC GCGACGGGCG CGGGTGGATT TGGCGGTTAC
GATCCGCGGG CCGCGCAGGG CGGGCAGGAC ATTCCGTTCG ACTTCAGTGG ATTTGATTTT
TCGCAGGAAG CGGGCGAACC GTCCGGCGGC GGCGGCTTCC GCGATATCTT CTCTTCCCTA
TTTGGCGGAG GGCGCGGTGG ACACGAGGAG CGTCCGCGGC CGCAGGCAGG AACCGACCTT
GAGTACCAGG TGAATGTGCC CTTCTGGGAT GCGATCCGCG GGACGACGGT GAAGCTGAAC
ATCCAGCGGC GCGAAGTTTG TTCGAACTGC CATGGCGAAG GCGAAATCGG CGGGACGCAT
ACGTGTCCGC AATGCCATGG CAAGGGCAAG ATCGAGACGG GCGGCGGGCC GATGAAGTTC
AACGTCACGT GTCCGACGTG CCACGGCACC GGCAAGGCGC GAACCCAGTG CCCGGTGTGC
CATGGCGAAG GCGCGATCAC GCGCAACGAG CCGCTGGAAT TCAAGATCAA GGCCGGTACG
CGCGATGGTC AGCGCATTCG TCTTGCGGGC AGAGGGAATG CCGGCACGAT GGGTGGCGCA
AGCGGCGATC TGTACATCAT TGTGAAGGCC GGGGAGCATC CGGTATTCAG GCGCGAGGGC
GATGACGTCT ACGTGACGGT GCCGGTGTCG GCGGTGGAAG CGGCGCTAGG AACGAAGATC
GAAGTGCCGA CGATTGATGG ACGCGCACTG CTGAAGATTC CGCCGGGAAC GAACAGCGGA
CAGAAGCTGC GGCTGCGCGA AAAAGGCGTT CCGAACGCTG CCGACGGAAC GAAGCGCGGC
GATGAGATTG TCGAGGTGAA GCTCATCGTG CCGAAGGTGA GCGATGAGCG CTCGAAGGAG
ATTCTGCGCG AGTTGCAGAA GCTGAATCCG GAGGATCCGC GGGAAGAGTT GTGGAGACAG
GTGTAA
 
Protein sequence
MATQTKDYYG ALGVKKNASA EEIRKAFRKL ARKYHPDVNP GDKKAEDKFK EISEANEVLS 
DPKKRKIYDQ LGFYSDNIDP AAAEAYARNG ATGAGGFGGY DPRAAQGGQD IPFDFSGFDF
SQEAGEPSGG GGFRDIFSSL FGGGRGGHEE RPRPQAGTDL EYQVNVPFWD AIRGTTVKLN
IQRREVCSNC HGEGEIGGTH TCPQCHGKGK IETGGGPMKF NVTCPTCHGT GKARTQCPVC
HGEGAITRNE PLEFKIKAGT RDGQRIRLAG RGNAGTMGGA SGDLYIIVKA GEHPVFRREG
DDVYVTVPVS AVEAALGTKI EVPTIDGRAL LKIPPGTNSG QKLRLREKGV PNAADGTKRG
DEIVEVKLIV PKVSDERSKE ILRELQKLNP EDPREELWRQ V