Gene Rcas_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3734 
Symbol 
ID5541236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4899496 
End bp4901364 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content61% 
IMG OID640895845 
Productchaperone protein DnaK 
Protein accessionYP_001433792 
Protein GI156743663 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000103994 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAAGA TCGTAGGCAT CGACCTGGGC ACGACCAACT CGGTCGTTGC GGTGATGGAA 
GGCGGCGATC CGGTCGTCAT TCCGAACGCC GAAGGCAACC GCACCACGCC GTCGGTTGTG
GCGTTCACCA AAAACGGCGA GCGCCTGGTT GGGTTGACGG CGAAGCGTCA GGCGACGATC
AATCCTGAGA ATACATTCTA TTCGATCAAG CGTTTTATCG GGCGCAATTT CGACGAAACC
ACCGTCGAGC GCGAAATGGT GCCGTTCAAA GTCGTGAAAG GACCGCGCAA CGACGTGCGC
GTCTATGCGC CGACCACCGG CAAAGAGTAT GCGCCGCAGG AAATTTCGGC GATGGTGTTG
CAGAAACTCA AGACCGACGC CGAAGCCTAC CTGGGTGAGC CGGTGACAAA GGCGGTCATT
ACGGTTCCGG CATACTTCAA CGACAGTCAG CGCCAGGCGA CGAAAGACGC CGGCAAGATC
GCGGGGCTTG AGGTGCTGCG CATCATCAAC GAGCCAACTG CCGCCGCGCT GGCGTATGGT
CTCGATAAGA AGAAGGACGA GACGATCCTG GTGTTCGATC TTGGCGGCGG CACGTTCGAC
GTGTCGGTGC TCGAAGTCGG CGATGGCGTG GTCGAAGTCA AAGCCACCAA CGGCGACACA
CACCTCGGCG GCGACGACTA CGACCAGCGG ATCGTCAACT GGTTGATCGA CGAGTTCCGC
AAGGATCAGG GGATCGACCT GAGCAAGGAT CGCCAGGCGT TGCAGCGCCT GAAGGAAGCC
GCTGAAAAGG CGAAGATCGA ACTGTCGAGC ATGTCGGAGA CGGAGATCAA CCTGCCGTTT
ATCACGGCGG ACGCCAGCGG TCCGAAGCAT TTGCAAATGC GCCTCAGCCG CGCAAAGTTC
GAGCAGTTGA CTGCCGACCT GACCGAGCGC CTGAAGGGTC CGTTCTTCCA GGCGTTGAAG
GACGCCGGTC TAAAGCCGAA CGACCTCGAT GAAGTCGTGC TGGTGGGTGG TTCGACCCGT
ATGCCGGTGG TCATCGACCT GGTGCGCAAA CTGACGGGCA AGGAGCCGAA CCGCAGCGTC
AACCCGGACG AGGTTGTGGC GATTGGCGCA GCGATCCAGG CAGGTGTGCT CGGCGGCGAC
GTGAAGGACG TGGTGTTGCT CGACGTGACG CCGCTCTCGC TCGGCGTCGA GACGCTCGGC
GGCGTGATGA CGAAACTGAT CGAGCGCAAC ACGACCATTC CGACCCGCAA GAGTGAGATC
TTCTCGACGG CTGCTGACGG GCAGACGGCG GTGGACATCC ACATCTTGCA GGGCGAGCGC
GAAATGGCCG CCGACAATAT GACGCTGGGG CGCTTCCGGC TCGAAGGCAT TCCGCCTGCG
CCGCGCGGCG TGCCGCAGAT TGAAGTGACC TTCGACATCG ACGCCAACGG CATCCTGAAC
GTGTCCGCCC GTGATAAGGC GACCGGCAAG GAGCAGCGGA TCACGATTAC GGCCAGCACG
AACCTGTCGA AGGAAGAGAT CGAGCGGATG GTTCGCGATG CCGAACTGCA CGCTGCCGAG
GACAAGCGGC GGCGCGAACT GGTCGAACTT AAGAACCAGT CCGACAGCCT GGCATACCAG
AGCGAGAAGT CGCTGAACGA ACTCGGCGAT AAAGTCGATC CGGCGCTCAA GAGCCGCATC
GAAGGTCTGA TCAAGGACCT GCGCGAGGCG ATCAGCCAGG AGAATGAGAG CCGTATGCGG
TCAATCTCGG CGGAGTTGCA ACAGGCGATG TACCAGGTGT CGCAGAGCGC TTACACCGGC
GGCAACGGCG ACGGCGCGCG CAAAGGCAAG GACGAAGGGG TCGTCGAGGG CGAGTATACC
GTCGAGTGA
 
Protein sequence
MSKIVGIDLG TTNSVVAVME GGDPVVIPNA EGNRTTPSVV AFTKNGERLV GLTAKRQATI 
NPENTFYSIK RFIGRNFDET TVEREMVPFK VVKGPRNDVR VYAPTTGKEY APQEISAMVL
QKLKTDAEAY LGEPVTKAVI TVPAYFNDSQ RQATKDAGKI AGLEVLRIIN EPTAAALAYG
LDKKKDETIL VFDLGGGTFD VSVLEVGDGV VEVKATNGDT HLGGDDYDQR IVNWLIDEFR
KDQGIDLSKD RQALQRLKEA AEKAKIELSS MSETEINLPF ITADASGPKH LQMRLSRAKF
EQLTADLTER LKGPFFQALK DAGLKPNDLD EVVLVGGSTR MPVVIDLVRK LTGKEPNRSV
NPDEVVAIGA AIQAGVLGGD VKDVVLLDVT PLSLGVETLG GVMTKLIERN TTIPTRKSEI
FSTAADGQTA VDIHILQGER EMAADNMTLG RFRLEGIPPA PRGVPQIEVT FDIDANGILN
VSARDKATGK EQRITITAST NLSKEEIERM VRDAELHAAE DKRRRELVEL KNQSDSLAYQ
SEKSLNELGD KVDPALKSRI EGLIKDLREA ISQENESRMR SISAELQQAM YQVSQSAYTG
GNGDGARKGK DEGVVEGEYT VE