Gene Rcas_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3046 
Symbol 
ID5540542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3948870 
End bp3950357 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID640895165 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001433118 
Protein GI156742989 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.742448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTG AACAGACGTG TCCCTCATGC AGCGCGACGC TGGGTTCTGA TGGGATTTGC 
CCGGCATGTG GTTCGGTGAC GCGCGGCTTT TTTCGCAGTC TCAACCTGGG CGCGCCACAG
GTGGCGGCTG CTGTCGCGCA GGGGCTTGAC CTCTACCGGT TGCTCGGCGT CGATCAGCAC
GCCGATGCAA TCACGATTGC GCGCCAGTAT CGTCGGTTGC GCGCGCTCTT TCCCGATGAC
CCGTCTGCGC TGGCGGCAGA GCCGAGGCGC AAGTTCGAAT TGCTTCAGGT TGCCGGGCGC
ATCTTGACCG ATCCCTCGCT GCGGGCGCTC TATAACGAAC TGCGGGTATC GGCTGCCGCC
GGCATTCAGC AAGGCGTGGT GCGCTGTGAG TCGTGCGGCG CTCCGCTGCA AGGCAATGAG
CCGCGCTGCC GCTACTGTGG TTCGCTGCGT CCGGGAGAAC CGGCGCCGCC AGCCACGCCG
CCCGACGCCG GACCACCGGT TGCGGAGCCG GTTGATTTTT ATGCCCTGCT GGGACTCTCA
CCCGCGCATC TGATGATCAA TCCGGGTGCT CGTCGCTCGG CGCGTCCGGC GCTGGATGCC
GCAGAGATGT TGCACGAATC ACGACCGCCG ACGCCGGAAG AGGTCGATGC CGCTTCCTAC
GCCTTTCAGC AGCGGACGCT CCTCCATCCT GGTTGGTCTG CCGCAGAACG TGAGGCGCGG
GTAGAGAATC TGGAGATTGC CCGGCGCATT CTGCGGAACG AACGACTCCG CAATCGTTAC
GATGCCTTCT GGCTGGCATT CCGTCAGGGT CGGTTCGACC ACGGTCACCT CGAAGGGTTG
CGCGCGCTGA TCGATGAAGT GCGCGCAGAT GAAACGACGC CCTCGACACT TTCAGTGGAA
GAGGCGGAAG CGCTGTTTCA ACAGGGTCGC GGGTTGCTTA CTGCCGGATT GCCGCGCGAA
GCGCTCGATC CGTTGCGCCG GGCGCGTGAG GCGCTGCCGC ATTCCGCAGA GGCGCACGCA
TGGTATGCCC GCGCCATCCT TGCGTCTGCC GACCCGCTCG ATCTCGGCGG ACACGCGCTG
CGTCAGGCGC TGGTGGCTCT CGAAACCGCC GCTCGTTCTA GTGCGCCGCT TCCCGATAGT
GAGTCGTACC TTGCTCTGTG TCGCGGGTTG CTGGCGCGTG ACGCAGGAGA TGCGCGCCAG
GCGGAAATCG AATTGTTGCG CGCCGCACAA CAGAACCCCT CGCTGGCGCA CGCCTGGTGC
GGATTGGCGG CTCTTGCGCT GGCGCGCGGG GCGAACGGCG ATGCTATCGA CCACTGCCAT
CGGGCGCTGG CGATTGACCC GCGCGATGAG CGCGCCTGGT TGATGTTGGC AGGCGCCTGC
CTGCGCACTC GACGCCATGC CGAGGCGCGC GCGGCGGCAG AGCGGGTTGC CGCGTTGCGC
AGCGATGGTG TGAGCGCCGA GAAGATACTG TCCGAAATCG CAAACTGA
 
Protein sequence
MTLEQTCPSC SATLGSDGIC PACGSVTRGF FRSLNLGAPQ VAAAVAQGLD LYRLLGVDQH 
ADAITIARQY RRLRALFPDD PSALAAEPRR KFELLQVAGR ILTDPSLRAL YNELRVSAAA
GIQQGVVRCE SCGAPLQGNE PRCRYCGSLR PGEPAPPATP PDAGPPVAEP VDFYALLGLS
PAHLMINPGA RRSARPALDA AEMLHESRPP TPEEVDAASY AFQQRTLLHP GWSAAEREAR
VENLEIARRI LRNERLRNRY DAFWLAFRQG RFDHGHLEGL RALIDEVRAD ETTPSTLSVE
EAEALFQQGR GLLTAGLPRE ALDPLRRARE ALPHSAEAHA WYARAILASA DPLDLGGHAL
RQALVALETA ARSSAPLPDS ESYLALCRGL LARDAGDARQ AEIELLRAAQ QNPSLAHAWC
GLAALALARG ANGDAIDHCH RALAIDPRDE RAWLMLAGAC LRTRRHAEAR AAAERVAALR
SDGVSAEKIL SEIAN