Gene Haur_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0440 
Symbol 
ID5732339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp514873 
End bp516729 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID641277566 
Productchaperone protein DnaK 
Protein accessionYP_001543219 
Protein GI159896972 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00195865 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAG TGATCGGTAT TGACCTTGGA ACAACCAATT CAGTGGTTGC GGTCATGGAA 
GGTGGCGAGG CGGTGGTGAT TCCTAACGCC GAAGGCGCTC GCACCACTCC TTCAATTGTT
GCCTTGAGCA AAAACGGTGA ACGCACGGTT GGCTTGGTTG CCAAGCGCCA ATCGGTCACC
AATCCCGAAA ATACAATTTA TTCGGTCAAG CGCTTTATTG GTCGTAAATT GGATCATCCC
AGCGTCCAAC GCGATAAAGA TTTGATTCCC TACCGCATGA CCAGCGCTCC CAATGGCGAT
GCGCGGGTCT TGATGGGCGG TCGCGATTAT TCGCCGCAAG AAGTTTCGGC CATGATTTTG
CAAAAACTCA AAGCCGATGC CGAAGCCTAT TTGGGTGAGC CTGTCAGCCA AGCGGTGATT
ACGGTTCCGG CCTATTTCGA TGATTCGCAG CGCCAAGCGA CCAAAGATGC TGGCAAAATT
GCTGGCCTTG AAGTATTGCG AATTATCAAC GAGCCAACCG CCAGCGCCTT GGCGTATGGC
TTGGAGCGCA ACAGTAACGA ATTAATTGTC GTCTATGACC TTGGTGGTGG TACGTTCGAT
GTTTCCATTT TGGAGCTTGG CGAGGGTGTG TTTGAGGTTC GGGCGACCAA CGGCGATACC
CACCTCGGCG GCGATGATTT TGATCAAAAG ATTATCGATT GGCTAGCCAG CGAGTTTCAA
CGCGAAAATA ATATCGATCT GCGTAGCGAC CGTATGGCAC TGCAACGTTT GAAAGAAGCT
TCGGAAAAAG CTAAGCAAGA ACTTTCGAGC GTGTTGCAAA CCGATATTTC GCTGCCATTT
ATCAGTGCCG ATGCCAGTGG CCCCAAACAC TTGAACACCA CTCTGACCCG TGCCAAACTT
GAGCAACTGA CCGCTGATTT GGTCGAACGC ACGCTCAAGC CAATCAAACT GGCCTTGCAA
GATGCTGGTT TGAAGCCAGG CGAAGTTGAT GAAGTGATTT TGGTTGGTGG CCAAACCCGC
ATGCCCGCAG TGCAGGCTGC GGTTAAAAAA TTCTTTGGCA AAGAGCCACA CAAGGGTGTA
AACCCTGATG AAGTGGTGGC GATTGGGGCT GCAATTCAGG CTGGCGTGCT GGCTGGTGAT
GTTACCGACG TGTTGTTGCT TGACGTAACG CCATTGACCT TGGGGATCGA AACCTATGGC
GGCGTGATGA CACCATTAAT TGATCGCAAC ACCACGATTC CAACCAAGCG CTCACAAATT
TTCTCAACTG CCAGCGACAA CCAAAACAGC GTTGAAATTC ATGTGTTGCA AGGCGAACGG
GCTGAAGCTC GACATAACAA ATCCTTGGCG CGTTTTACCC TCGATGGCAT TCCAGCCGCG
CCGCGTGGCG TGCCGCAAAT TGAAGTTATT TTTGATATTG ATGCCAACGG GATTGTCAAC
GTCAGCGCAA CCGATAAAGC TACCAACAAA GAGCAAAAAA TCACGATCAC GCCATCATCG
GGCTTGAATG ATGATGAAAT TTCAGCCATG ATTCGCGATG CCGAAGATCA TGCCGATAGC
GATGCTCGCC GCCGTGATCA GATTGCGACC CGCAACAAAG CCGATGGCGT GATCTACGCT
GCTGATCGGA TGTTGCGCGA AGCTGATGAT AACGTTGATT ACACTGCCCG CAACACGGTC
GAAGATCGGA TTGCAGCGCT ACGGGCGGTG CTTGATGGCG ATGATATGGA AGCGATCAAC
AATCGTACCG CTGAATTGAG CGTGGCAATG CAACGACTCA AACCAGCGCC CGATTTTGGC
ATCGAGCCAG AAACGCCGAG CCAAGACCAC GGCTCCGCCG ACGAGGTGGA ACTGTAA
 
Protein sequence
MGKVIGIDLG TTNSVVAVME GGEAVVIPNA EGARTTPSIV ALSKNGERTV GLVAKRQSVT 
NPENTIYSVK RFIGRKLDHP SVQRDKDLIP YRMTSAPNGD ARVLMGGRDY SPQEVSAMIL
QKLKADAEAY LGEPVSQAVI TVPAYFDDSQ RQATKDAGKI AGLEVLRIIN EPTASALAYG
LERNSNELIV VYDLGGGTFD VSILELGEGV FEVRATNGDT HLGGDDFDQK IIDWLASEFQ
RENNIDLRSD RMALQRLKEA SEKAKQELSS VLQTDISLPF ISADASGPKH LNTTLTRAKL
EQLTADLVER TLKPIKLALQ DAGLKPGEVD EVILVGGQTR MPAVQAAVKK FFGKEPHKGV
NPDEVVAIGA AIQAGVLAGD VTDVLLLDVT PLTLGIETYG GVMTPLIDRN TTIPTKRSQI
FSTASDNQNS VEIHVLQGER AEARHNKSLA RFTLDGIPAA PRGVPQIEVI FDIDANGIVN
VSATDKATNK EQKITITPSS GLNDDEISAM IRDAEDHADS DARRRDQIAT RNKADGVIYA
ADRMLREADD NVDYTARNTV EDRIAALRAV LDGDDMEAIN NRTAELSVAM QRLKPAPDFG
IEPETPSQDH GSADEVEL