Gene Cagg_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1856 
Symbol 
ID7266346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2275430 
End bp2276518 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID643566692 
Productheat shock protein DnaJ domain protein 
Protein accessionYP_002463187 
Protein GI219848754 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT ATTACGAAAT TCTACAAGTT CATCCCAAGG CTGATGCAGA AGCGATTCGG 
GCCGCGTATG AACGACTACG CGAGCGTTAC GCTCCCGAAC GGCTAGAAGG TGCGGCTGAC
GAGTTAGTTG CTTTGGCACG CCAACGTCGT GATGAAATCG AACGGGCTTA CGCCATATTG
AGTGATCCAC AACGACGTGC CGAATATGAT CGGGAGCGAC AGGAGTCGGC GCACGCGGAA
CCGAAGACGA AACATACCCC CAGTTCGTCA GCACCGATTG TTGATGATGA TGAGCTTATC
GATTTTCGTC CGCTACCACC GGCGCGTCGG CAAGAACGGC CACCGGGCTT CAACCCACAA
CCGTATCTCT CACCTACCCA CAAGACATCA CCCGGTCGGG GACGTGCCGT GCAACGCCAG
TTGCCGGTCT GGGTCTTGCC TTCACTGGTC GTAGCCGCCG CAACGTTCAG TATTGTGCTC
GGTACACTGA TCAGTACGGC TGTGGTCGCG CCGACGACGC TCACGCAGCC AACGGCTACC
GTTGTTCCAC CAACGCCAAC CTTGCGTGAG GTGCTCGATC AGTTTGAAGG ACAGGTAATC
GCCGCACAAC AGGTGGTAGG ACAAGTGCCG GATAACCCCA ACGCATGGAT AAACTTGGGT
AATGCGCTGT TCGACAGTGT TGTGTTTGTC CGTGAACAAC TCGCTAACGG TGATGCAGAA
ACACAGCGTA TTTATCAGGA GCGCTTGCCG CGTTGGTTAG AAGCGGTAGA CGCTTACCGC
AAGGCACTAG AACTTCAACC GGGCAATGAT GTGGTACGGG CCGATATTGC TGCCAGCTTG
TGCTACTACG GCCTTGATAC TCGTGACCCA AAGCGAGCGC AGGAGGGGCT TGCCGAAGCC
GAGAAAGCAC TTGCCGCTGC CCCGCAAGAT GCGCGTGTAT TACTTTCACA CGGTATCTGT
CTGATCGCTG TTGATCCGCC ACAAGTCGAA CGAGCTATTC AACAGTGGCG GTTGGTATTG
TCAATCCCAG GAGTGAATCC GGGCCTGCAA TTACAAGCCC AGATTCTGAT CAACGAGTAT
AGTCAGTAA
 
Protein sequence
MTDYYEILQV HPKADAEAIR AAYERLRERY APERLEGAAD ELVALARQRR DEIERAYAIL 
SDPQRRAEYD RERQESAHAE PKTKHTPSSS APIVDDDELI DFRPLPPARR QERPPGFNPQ
PYLSPTHKTS PGRGRAVQRQ LPVWVLPSLV VAAATFSIVL GTLISTAVVA PTTLTQPTAT
VVPPTPTLRE VLDQFEGQVI AAQQVVGQVP DNPNAWINLG NALFDSVVFV REQLANGDAE
TQRIYQERLP RWLEAVDAYR KALELQPGND VVRADIAASL CYYGLDTRDP KRAQEGLAEA
EKALAAAPQD ARVLLSHGIC LIAVDPPQVE RAIQQWRLVL SIPGVNPGLQ LQAQILINEY
SQ