Gene Daro_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1954 
SymbolhscA 
ID3567400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2105259 
End bp2107148 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content62% 
IMG OID637680425 
Productchaperone protein HscA 
Protein accessionYP_285170 
Protein GI71907583 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR01991] Fe-S protein assembly chaperone HscA 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.296275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGT TTCAAATCGC CGAACCTGGC GAATCGGCGG CACCGCACGA GCACAAGCTC 
GCCATCGGCA TCGACCTGGG TACCACCAAC TCGCTGGTTG CCACCGTGCG CAGCGGCATC
GCTGTCTGCC TGAATGACGA TCAAGGGCGT CCGCTGCTGC CTTCCGTCGT GCGCTACCAC
GCCGACGGCA CGACCGAGGT TGGCTACGAT GCGCAGAAGA ATCAGGCGGT CGACCCGAAG
AACACGATTG TTTCGGTCAA GCGCTTCATG GGGCGGGGCA CCAAGGATAT TGCTCATGTT
GAGTCGATGC CTTACGACTT CGTCGAGTCG CCGGGCATGG TCAAGCTAAA AACCCACGCC
GGCATCAAGA GTCCGGTCGA GGTTTCGGCA GAAATCCTGA AATCCCTGAA GGCGAGGGCC
GAGGCGGCTC TCGGTGGCGA CCTGGTTGGG GCGGTGATCA CCGTGCCAGC CTATTTTGAC
GATGCCCAGC GGCAGGCGAC CAAGGATGCT GCCCGCCTGG CCGGCCTCAA TGTGCTGCGT
CTGCTCAACG AGCCGACCGC TGCTGCCATC GCCTATGGGC TGGACAACGG TGCGGAGGGC
GTCTACGCGG TCTACGACCT CGGTGGCGGC ACCTTCGATA TCTCCATTCT CAAGCTGACC
AAGGGCGTTT TCGAGGTCAT GTCCACCGGC GGCGATTCGG CCCTGGGCGG CGACGACTTC
GATCACCGCA TCTACTGCTG GGTCATTGAG CAGGCCGAAT TGCAACTGCT CTCGCCGGAA
GACGCGCGGC GCCTGATGAT GCGCTGCCGC GAGGCCAAGG AGTTCCTGAC CAACAATCCG
GAGGCGCCAA TCTCGCTGCG CCTGGCCTCC GGTGAGAACG TCGAGGTCAA ACTTGACGTT
GCCACTTTTG CCCAGATTAC CCAGACCCTG ATTTCCAAGA CCCTGCAGCC GGTCAAGAAA
GCACTACGCG ATGCTGGCCT GCGCGCCGAA GACATCAAGG GGGTGGTCAT GGTTGGCGGC
GCAACGCGCA TGCCGCAGGT GCAGAAGGCA GTCGGTGATT TCTTCCGCCA GGAACCGCTG
ACCAATCTCG ACCCGGACAA AGTCGTCGCC CTCGGTGCTG CGACGCAGGC CAACCTGCTG
GCCGGCAACA AGACCGGTCG TGACGATTGG CTGCTGCTCG ACGTCATTCC GCTGTCGCTC
GGCCTGGAAA CCATGGGCGG CCTGACTGAA AAGGTCATCC CGCGCAATTC GACCATCCCG
ACGGCTCGCG CTCAGGAATT CACCACCTTC AAGGACGGCC AGACGGCGAT GGTCATCCAT
GTCGTGCAGG GCGAGCGCGA AATGATCTCC GACTGCCGTT CTCTGGCCCG TTTCGAACTG
CGCGGTATTC CGCCGATGGT GGCCGGCGCA GCGCGTATTC GAGTCACTTT CCAGGTTGAT
GCCGACGGCC TGCTGTCGGT GACGGCCCGC GAACAGACGA CCGGCGTTGA ATCCAGCATT
ACCGTCAAGC CGTCTTATGG CCTGTCGGAT GACGAGATCG CCGGCATGCT CAAGGATTCG
ATGGAGCACG CCAAGGACGA CGCCATGACC CGCGCGCTGA AGGAAGCGCA GGTCGAGGCA
CAACGCATGA TCGAGGCAAC CGAGGCGGCG CTGGCTGAAG ACCCGCATCT GCTCAACGAG
GCGGAAACGG CCAAGATCAA GGCCACGATC GCCAAGCTGG CCGAAACCAT GGCTGGCGAA
AATCGTCGCC TGATCAATAT CGCGATGGAT GACCTCGGTT TCGAAACCCA GGCCTTTGCC
CATCGCCGCA TGGATCAGAG CATCAAAAAA GTGCTGTCCG GCCGCAACGT TGCCGACATC
AAGGTTGGAG AAGAAGGAGA AAAGGCATGA
 
Protein sequence
MALFQIAEPG ESAAPHEHKL AIGIDLGTTN SLVATVRSGI AVCLNDDQGR PLLPSVVRYH 
ADGTTEVGYD AQKNQAVDPK NTIVSVKRFM GRGTKDIAHV ESMPYDFVES PGMVKLKTHA
GIKSPVEVSA EILKSLKARA EAALGGDLVG AVITVPAYFD DAQRQATKDA ARLAGLNVLR
LLNEPTAAAI AYGLDNGAEG VYAVYDLGGG TFDISILKLT KGVFEVMSTG GDSALGGDDF
DHRIYCWVIE QAELQLLSPE DARRLMMRCR EAKEFLTNNP EAPISLRLAS GENVEVKLDV
ATFAQITQTL ISKTLQPVKK ALRDAGLRAE DIKGVVMVGG ATRMPQVQKA VGDFFRQEPL
TNLDPDKVVA LGAATQANLL AGNKTGRDDW LLLDVIPLSL GLETMGGLTE KVIPRNSTIP
TARAQEFTTF KDGQTAMVIH VVQGEREMIS DCRSLARFEL RGIPPMVAGA ARIRVTFQVD
ADGLLSVTAR EQTTGVESSI TVKPSYGLSD DEIAGMLKDS MEHAKDDAMT RALKEAQVEA
QRMIEATEAA LAEDPHLLNE AETAKIKATI AKLAETMAGE NRRLINIAMD DLGFETQAFA
HRRMDQSIKK VLSGRNVADI KVGEEGEKA