Gene Rcas_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4441 
Symbol 
ID5541954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5710033 
End bp5710998 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content63% 
IMG OID640896539 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001434475 
Protein GI156744346 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.342346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC AGGATTTCTA CGACATCTTG CAGGTTGCGC CCGACGCTGA TGAAGAGGCG 
ATTTGCGCCG CTTATCAGCG TCTGCGCGAA CAGTATGATC CGCAAAAGTT GAACGGCGCC
GCTGCGGAAC TGGTCGAACT AGCGCAGCAG CGCCTGTCAC GCATCGACGA GGCATACGCG
ACGCTCTCTG ATGCGCAGCG CCGCGCACAG TACGATGCGC AGCGTCAGGC GTCTCTCCAG
GACGTGCCCG ATTACCGTCC GCTTCCCCCA GCACAGCACG CAGAACGCCC CCGTGATTTC
AACCCCCGTC CGACCATCAA CCAGCCAGCG GCGGCGGCAA TTGCGGGTCC GGCAGCAGCG
GTGATTGCGG TGCTGGCGAT TGCGCTGGTA TCGATCATTG GCGGATTAAT CTTGACCGGT
GGCGGAAGTG TGCCGCAAGC GGTCCCTACT CCCACAACTT CGCCGATGGA CGCGCTGGAG
ACCATGATCG CCCGCGCCCG TCAGATTGCT GAACAGAACG AGAACGATGC GCAGGCGTGG
TTGGACTATG CCAACCTCCT CTACGACAGT GTCCAGATTG TGCGCGAACA GGCGCCCAAT
AGCGTGCTGT ATCAGCAACG CCTGCCGCGC TGGCTCGAAG CGGCAAAGGC TTATGAGCGC
GTCCTCGAAC TCGATCCGAC CAACGCAGTC GCGCGCGGCG ACCTCGGCGC CTCCCGCTGT
TTCTATGGCG CCGGCGTGGG GGATCAGACG TTTGTGGTGG AGGGATTGAA GGACCTGGAG
ACGGCCACCG CAGCACGCCC CGAAGATACG CGCCTGCTGC TCAATCTTGG CTCGTGCCTG
GCATCGGCCC AACCGCCGCG CACCGACGAA GCCATCGAGG TCTGGCAGCG CATTATCTCA
ATTGCGCCAA CCGGATCGCC CGTCGCCAAC GAAGCGCAGC GTCTGATCGA TCAGGTGCGC
AGGTAG
 
Protein sequence
MATQDFYDIL QVAPDADEEA ICAAYQRLRE QYDPQKLNGA AAELVELAQQ RLSRIDEAYA 
TLSDAQRRAQ YDAQRQASLQ DVPDYRPLPP AQHAERPRDF NPRPTINQPA AAAIAGPAAA
VIAVLAIALV SIIGGLILTG GGSVPQAVPT PTTSPMDALE TMIARARQIA EQNENDAQAW
LDYANLLYDS VQIVREQAPN SVLYQQRLPR WLEAAKAYER VLELDPTNAV ARGDLGASRC
FYGAGVGDQT FVVEGLKDLE TATAARPEDT RLLLNLGSCL ASAQPPRTDE AIEVWQRIIS
IAPTGSPVAN EAQRLIDQVR R