Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0440 |
Symbol | |
ID | 5732339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 514873 |
End bp | 516729 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277566 |
Product | chaperone protein DnaK |
Protein accession | YP_001543219 |
Protein GI | 159896972 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00195865 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAG TGATCGGTAT TGACCTTGGA ACAACCAATT CAGTGGTTGC GGTCATGGAA GGTGGCGAGG CGGTGGTGAT TCCTAACGCC GAAGGCGCTC GCACCACTCC TTCAATTGTT GCCTTGAGCA AAAACGGTGA ACGCACGGTT GGCTTGGTTG CCAAGCGCCA ATCGGTCACC AATCCCGAAA ATACAATTTA TTCGGTCAAG CGCTTTATTG GTCGTAAATT GGATCATCCC AGCGTCCAAC GCGATAAAGA TTTGATTCCC TACCGCATGA CCAGCGCTCC CAATGGCGAT GCGCGGGTCT TGATGGGCGG TCGCGATTAT TCGCCGCAAG AAGTTTCGGC CATGATTTTG CAAAAACTCA AAGCCGATGC CGAAGCCTAT TTGGGTGAGC CTGTCAGCCA AGCGGTGATT ACGGTTCCGG CCTATTTCGA TGATTCGCAG CGCCAAGCGA CCAAAGATGC TGGCAAAATT GCTGGCCTTG AAGTATTGCG AATTATCAAC GAGCCAACCG CCAGCGCCTT GGCGTATGGC TTGGAGCGCA ACAGTAACGA ATTAATTGTC GTCTATGACC TTGGTGGTGG TACGTTCGAT GTTTCCATTT TGGAGCTTGG CGAGGGTGTG TTTGAGGTTC GGGCGACCAA CGGCGATACC CACCTCGGCG GCGATGATTT TGATCAAAAG ATTATCGATT GGCTAGCCAG CGAGTTTCAA CGCGAAAATA ATATCGATCT GCGTAGCGAC CGTATGGCAC TGCAACGTTT GAAAGAAGCT TCGGAAAAAG CTAAGCAAGA ACTTTCGAGC GTGTTGCAAA CCGATATTTC GCTGCCATTT ATCAGTGCCG ATGCCAGTGG CCCCAAACAC TTGAACACCA CTCTGACCCG TGCCAAACTT GAGCAACTGA CCGCTGATTT GGTCGAACGC ACGCTCAAGC CAATCAAACT GGCCTTGCAA GATGCTGGTT TGAAGCCAGG CGAAGTTGAT GAAGTGATTT TGGTTGGTGG CCAAACCCGC ATGCCCGCAG TGCAGGCTGC GGTTAAAAAA TTCTTTGGCA AAGAGCCACA CAAGGGTGTA AACCCTGATG AAGTGGTGGC GATTGGGGCT GCAATTCAGG CTGGCGTGCT GGCTGGTGAT GTTACCGACG TGTTGTTGCT TGACGTAACG CCATTGACCT TGGGGATCGA AACCTATGGC GGCGTGATGA CACCATTAAT TGATCGCAAC ACCACGATTC CAACCAAGCG CTCACAAATT TTCTCAACTG CCAGCGACAA CCAAAACAGC GTTGAAATTC ATGTGTTGCA AGGCGAACGG GCTGAAGCTC GACATAACAA ATCCTTGGCG CGTTTTACCC TCGATGGCAT TCCAGCCGCG CCGCGTGGCG TGCCGCAAAT TGAAGTTATT TTTGATATTG ATGCCAACGG GATTGTCAAC GTCAGCGCAA CCGATAAAGC TACCAACAAA GAGCAAAAAA TCACGATCAC GCCATCATCG GGCTTGAATG ATGATGAAAT TTCAGCCATG ATTCGCGATG CCGAAGATCA TGCCGATAGC GATGCTCGCC GCCGTGATCA GATTGCGACC CGCAACAAAG CCGATGGCGT GATCTACGCT GCTGATCGGA TGTTGCGCGA AGCTGATGAT AACGTTGATT ACACTGCCCG CAACACGGTC GAAGATCGGA TTGCAGCGCT ACGGGCGGTG CTTGATGGCG ATGATATGGA AGCGATCAAC AATCGTACCG CTGAATTGAG CGTGGCAATG CAACGACTCA AACCAGCGCC CGATTTTGGC ATCGAGCCAG AAACGCCGAG CCAAGACCAC GGCTCCGCCG ACGAGGTGGA ACTGTAA
|
Protein sequence | MGKVIGIDLG TTNSVVAVME GGEAVVIPNA EGARTTPSIV ALSKNGERTV GLVAKRQSVT NPENTIYSVK RFIGRKLDHP SVQRDKDLIP YRMTSAPNGD ARVLMGGRDY SPQEVSAMIL QKLKADAEAY LGEPVSQAVI TVPAYFDDSQ RQATKDAGKI AGLEVLRIIN EPTASALAYG LERNSNELIV VYDLGGGTFD VSILELGEGV FEVRATNGDT HLGGDDFDQK IIDWLASEFQ RENNIDLRSD RMALQRLKEA SEKAKQELSS VLQTDISLPF ISADASGPKH LNTTLTRAKL EQLTADLVER TLKPIKLALQ DAGLKPGEVD EVILVGGQTR MPAVQAAVKK FFGKEPHKGV NPDEVVAIGA AIQAGVLAGD VTDVLLLDVT PLTLGIETYG GVMTPLIDRN TTIPTKRSQI FSTASDNQNS VEIHVLQGER AEARHNKSLA RFTLDGIPAA PRGVPQIEVI FDIDANGIVN VSATDKATNK EQKITITPSS GLNDDEISAM IRDAEDHADS DARRRDQIAT RNKADGVIYA ADRMLREADD NVDYTARNTV EDRIAALRAV LDGDDMEAIN NRTAELSVAM QRLKPAPDFG IEPETPSQDH GSADEVEL
|
| |