Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0683 |
Symbol | |
ID | 5732584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 784033 |
End bp | 786465 |
Gene Length | 2433 bp |
Protein Length | 810 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277813 |
Product | ATP-dependent protease La |
Protein accession | YP_001543459 |
Protein GI | 159897212 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0018045 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGAG AAAAAGACCG CCCTCGTACA GGTACCGAGC GCACTTTACC GTTGGTGGTA TTGGGCGAAA TTGTGATCAT GCCGCACATG ACTGTGCCGC TCCAGGTTGG GCAGGGCAAG TCTTATCGGG CCATGGAACA AGCCATGGAA GACGATCAAC ATGTTTTGTT GATCTTTGTT TCTGAGGCTG AAATTGAAGC TTATAAGGGC CACGAACCCC AACAACTACC CAAAGTTGGC GTGGTCGCAC GCTTGGAAGA TTTCTCCCAA TTGCCCGATG GCACGGTAAA AATTGTGCTC GAAGGGATTA CCCGCGCTGA GATTGTTGAT TGTGTTCAGA GCGATCCCTT CTATCGGGTA GCTTGCCGCT ACTTGCCCGA TCAGGAACCC AAAGGCATCG AAGTCGATGC ATTGATGGAT ACGGTCAAAC AACAAATTAC TGAATTTGTC GATTATTTAG GCGAAATTCC TCAAGAAGCA GTTGCCTTTG TGCACCGAAT CACGACACCT GGCCATTTGG CCGATTTGGT GACTTATGGC CCAGCCTTCT CATTCCAAGA TCGCTTGGAA TTGTTGAATG AAATGGAGCC ATTGGCTCGT TTGAATCGGG TGCAAGTGAT TTTGGCCCGT CAATTAGAGC TGTTGCGGCT CCGCGCCAAA ATTCAATCCG ACACCAAAGA AGTGCTCGAT CAAGGCCAAA AAGAGTATTT CTTACGCGAA CAAATGCGGG TGATTCGCCG TGAACTAGGC GAAGATGACG ATGGCGATGA TCCAATTGAT GAATTGCGCC GCAAAGTCAA CGAACTCAAC GCGCCGCAAT ATGTCAAAGA TCAAGCCTTG CACGAAATTA AGCGCCTCGC TCAACAAGGC ATGAATTCGC CCGAAGCTGG CGTGATTCGT ACCTACCTTG ATTGGATTAT CGCTTTGCCG TGGAATGCCG ATCAAGTCGC CGCGATTTCG CTGGTGCAAT CGCGCCAAGT GCTTGATGAA GATCATTATG GCTTGGAAAA AGTCAAAGAG CGAATTTTGG AATATCTGGC GGTGCGCAAG CTCGCTGGTA GCAAAATGCG TTCGCCAATT TTGTGCTTCG TCGGCCCGCC AGGCGTGGGT AAAACCAGCC TTGGTCGCTC GATTGCTCGG GCCTTGGATC GTCCATTCGT GCGCCAGTCG CTGGGTGGGG TACACGACGA GGCCGAAATT CGCGGTCACC GCCGAACTTA CATCGGAGCC ATGCCAGGCC GGATTATTCA AGGCATGAAA ACCGCCAAAT CGCGTCAAGC AGTCTTTATG CTTGATGAAA TCGATAAAAT TGGCAATGAT TTCCGTGGTG ACCCAACTTC GGCATTGCTA GAAGTGCTTG ATCCTGAGCA AAACAACACG TTCTCAGATC ACTATTTGGA AATTCCATTT GATTTGAGCC AAGTGGTGTT TGTGGCAACC GCTAACCAAC TTGAGCCAAT TCCAGCGCCG CTGCGCGACC GGATGGAAAT TATCGAGATT GGCGGCTATA CCGAGGACGA AAAACTAGCG ATTGCTCAAG GCTTCTTGCT GCCCAAGCAA CGCGAGTTCC ATGGACTCGA AAGCAGCCAG TTGGAATTAA CCGATGCTGC GATCTTGAAG TTGATTCGCG AATATACCCG CGAAGCTGGT GTGCGTAACC TTGAGCGCGA AGTTGCGGCC TTGTGTCGCA AAATTGCCCG TAAGGTAGCT GAATCGCAGG ATGAAGAAGC ACCACAAGAA GGCAAGAAGC GCCGTAAGAA GGCCAAAAAA GCCGAGCCAA CCAAGTTCTT GATTGACGCT GCTGATGTGC CAATCTACCT TGGACCTGAA CATTTCAGCT TTGGAATGGC CGAAGTTTCC GATCAAATTG GGGTCGCGAC TGGAGTTGCT TGGACACCAA CGGGCGGCGA TATTCTTTCC TTCGAAGTTT TGCCATTGAC TGGCAAAGGC GAATTGCGCC TCACTGGTCA ATTGGGCGAT GTGATGAAGG AATCGGCCCA AGCAGCCATG TCGTATGTGC GATATCGCGC CAAAGAGTTG GGCATCGAGC CAAATTACTT CGACGAGCAT TCGATCCATA TTCACGTGCC CGAGGGTGCA GTGCCCAAAG ATGGCCCATC GGCTGGGATT ACATTGACGA TTGCGTTGAT CTCCGCGATG ACTGGCCGAG CGGTGCGCCG CGATGTGGCA ATGACAGGCG AAGTCACCTT GCGCGGGCGG GTCTTGCCGA TTGGTGGCTT GAAAGAAAAA ACATTGGCCG CTCATCGTGC AGGCATCAAA ACTTTTATCT TACCCAAGGA AAACGCCAAA GATATTGTTG ATCTGCCTGA AAAAGTGCGC CAAGATCTGC AATTGATTCC GGTCGAAACC ATGGATGAAG TGCTGACAAT TGCATTGATG CCATTTATTC GCCAAGACGT GATCGCCAGC TAA
|
Protein sequence | MTREKDRPRT GTERTLPLVV LGEIVIMPHM TVPLQVGQGK SYRAMEQAME DDQHVLLIFV SEAEIEAYKG HEPQQLPKVG VVARLEDFSQ LPDGTVKIVL EGITRAEIVD CVQSDPFYRV ACRYLPDQEP KGIEVDALMD TVKQQITEFV DYLGEIPQEA VAFVHRITTP GHLADLVTYG PAFSFQDRLE LLNEMEPLAR LNRVQVILAR QLELLRLRAK IQSDTKEVLD QGQKEYFLRE QMRVIRRELG EDDDGDDPID ELRRKVNELN APQYVKDQAL HEIKRLAQQG MNSPEAGVIR TYLDWIIALP WNADQVAAIS LVQSRQVLDE DHYGLEKVKE RILEYLAVRK LAGSKMRSPI LCFVGPPGVG KTSLGRSIAR ALDRPFVRQS LGGVHDEAEI RGHRRTYIGA MPGRIIQGMK TAKSRQAVFM LDEIDKIGND FRGDPTSALL EVLDPEQNNT FSDHYLEIPF DLSQVVFVAT ANQLEPIPAP LRDRMEIIEI GGYTEDEKLA IAQGFLLPKQ REFHGLESSQ LELTDAAILK LIREYTREAG VRNLEREVAA LCRKIARKVA ESQDEEAPQE GKKRRKKAKK AEPTKFLIDA ADVPIYLGPE HFSFGMAEVS DQIGVATGVA WTPTGGDILS FEVLPLTGKG ELRLTGQLGD VMKESAQAAM SYVRYRAKEL GIEPNYFDEH SIHIHVPEGA VPKDGPSAGI TLTIALISAM TGRAVRRDVA MTGEVTLRGR VLPIGGLKEK TLAAHRAGIK TFILPKENAK DIVDLPEKVR QDLQLIPVET MDEVLTIALM PFIRQDVIAS
|
| |