Gene Haur_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0683 
Symbol 
ID5732584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp784033 
End bp786465 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content51% 
IMG OID641277813 
ProductATP-dependent protease La 
Protein accessionYP_001543459 
Protein GI159897212 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0018045 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGAG AAAAAGACCG CCCTCGTACA GGTACCGAGC GCACTTTACC GTTGGTGGTA 
TTGGGCGAAA TTGTGATCAT GCCGCACATG ACTGTGCCGC TCCAGGTTGG GCAGGGCAAG
TCTTATCGGG CCATGGAACA AGCCATGGAA GACGATCAAC ATGTTTTGTT GATCTTTGTT
TCTGAGGCTG AAATTGAAGC TTATAAGGGC CACGAACCCC AACAACTACC CAAAGTTGGC
GTGGTCGCAC GCTTGGAAGA TTTCTCCCAA TTGCCCGATG GCACGGTAAA AATTGTGCTC
GAAGGGATTA CCCGCGCTGA GATTGTTGAT TGTGTTCAGA GCGATCCCTT CTATCGGGTA
GCTTGCCGCT ACTTGCCCGA TCAGGAACCC AAAGGCATCG AAGTCGATGC ATTGATGGAT
ACGGTCAAAC AACAAATTAC TGAATTTGTC GATTATTTAG GCGAAATTCC TCAAGAAGCA
GTTGCCTTTG TGCACCGAAT CACGACACCT GGCCATTTGG CCGATTTGGT GACTTATGGC
CCAGCCTTCT CATTCCAAGA TCGCTTGGAA TTGTTGAATG AAATGGAGCC ATTGGCTCGT
TTGAATCGGG TGCAAGTGAT TTTGGCCCGT CAATTAGAGC TGTTGCGGCT CCGCGCCAAA
ATTCAATCCG ACACCAAAGA AGTGCTCGAT CAAGGCCAAA AAGAGTATTT CTTACGCGAA
CAAATGCGGG TGATTCGCCG TGAACTAGGC GAAGATGACG ATGGCGATGA TCCAATTGAT
GAATTGCGCC GCAAAGTCAA CGAACTCAAC GCGCCGCAAT ATGTCAAAGA TCAAGCCTTG
CACGAAATTA AGCGCCTCGC TCAACAAGGC ATGAATTCGC CCGAAGCTGG CGTGATTCGT
ACCTACCTTG ATTGGATTAT CGCTTTGCCG TGGAATGCCG ATCAAGTCGC CGCGATTTCG
CTGGTGCAAT CGCGCCAAGT GCTTGATGAA GATCATTATG GCTTGGAAAA AGTCAAAGAG
CGAATTTTGG AATATCTGGC GGTGCGCAAG CTCGCTGGTA GCAAAATGCG TTCGCCAATT
TTGTGCTTCG TCGGCCCGCC AGGCGTGGGT AAAACCAGCC TTGGTCGCTC GATTGCTCGG
GCCTTGGATC GTCCATTCGT GCGCCAGTCG CTGGGTGGGG TACACGACGA GGCCGAAATT
CGCGGTCACC GCCGAACTTA CATCGGAGCC ATGCCAGGCC GGATTATTCA AGGCATGAAA
ACCGCCAAAT CGCGTCAAGC AGTCTTTATG CTTGATGAAA TCGATAAAAT TGGCAATGAT
TTCCGTGGTG ACCCAACTTC GGCATTGCTA GAAGTGCTTG ATCCTGAGCA AAACAACACG
TTCTCAGATC ACTATTTGGA AATTCCATTT GATTTGAGCC AAGTGGTGTT TGTGGCAACC
GCTAACCAAC TTGAGCCAAT TCCAGCGCCG CTGCGCGACC GGATGGAAAT TATCGAGATT
GGCGGCTATA CCGAGGACGA AAAACTAGCG ATTGCTCAAG GCTTCTTGCT GCCCAAGCAA
CGCGAGTTCC ATGGACTCGA AAGCAGCCAG TTGGAATTAA CCGATGCTGC GATCTTGAAG
TTGATTCGCG AATATACCCG CGAAGCTGGT GTGCGTAACC TTGAGCGCGA AGTTGCGGCC
TTGTGTCGCA AAATTGCCCG TAAGGTAGCT GAATCGCAGG ATGAAGAAGC ACCACAAGAA
GGCAAGAAGC GCCGTAAGAA GGCCAAAAAA GCCGAGCCAA CCAAGTTCTT GATTGACGCT
GCTGATGTGC CAATCTACCT TGGACCTGAA CATTTCAGCT TTGGAATGGC CGAAGTTTCC
GATCAAATTG GGGTCGCGAC TGGAGTTGCT TGGACACCAA CGGGCGGCGA TATTCTTTCC
TTCGAAGTTT TGCCATTGAC TGGCAAAGGC GAATTGCGCC TCACTGGTCA ATTGGGCGAT
GTGATGAAGG AATCGGCCCA AGCAGCCATG TCGTATGTGC GATATCGCGC CAAAGAGTTG
GGCATCGAGC CAAATTACTT CGACGAGCAT TCGATCCATA TTCACGTGCC CGAGGGTGCA
GTGCCCAAAG ATGGCCCATC GGCTGGGATT ACATTGACGA TTGCGTTGAT CTCCGCGATG
ACTGGCCGAG CGGTGCGCCG CGATGTGGCA ATGACAGGCG AAGTCACCTT GCGCGGGCGG
GTCTTGCCGA TTGGTGGCTT GAAAGAAAAA ACATTGGCCG CTCATCGTGC AGGCATCAAA
ACTTTTATCT TACCCAAGGA AAACGCCAAA GATATTGTTG ATCTGCCTGA AAAAGTGCGC
CAAGATCTGC AATTGATTCC GGTCGAAACC ATGGATGAAG TGCTGACAAT TGCATTGATG
CCATTTATTC GCCAAGACGT GATCGCCAGC TAA
 
Protein sequence
MTREKDRPRT GTERTLPLVV LGEIVIMPHM TVPLQVGQGK SYRAMEQAME DDQHVLLIFV 
SEAEIEAYKG HEPQQLPKVG VVARLEDFSQ LPDGTVKIVL EGITRAEIVD CVQSDPFYRV
ACRYLPDQEP KGIEVDALMD TVKQQITEFV DYLGEIPQEA VAFVHRITTP GHLADLVTYG
PAFSFQDRLE LLNEMEPLAR LNRVQVILAR QLELLRLRAK IQSDTKEVLD QGQKEYFLRE
QMRVIRRELG EDDDGDDPID ELRRKVNELN APQYVKDQAL HEIKRLAQQG MNSPEAGVIR
TYLDWIIALP WNADQVAAIS LVQSRQVLDE DHYGLEKVKE RILEYLAVRK LAGSKMRSPI
LCFVGPPGVG KTSLGRSIAR ALDRPFVRQS LGGVHDEAEI RGHRRTYIGA MPGRIIQGMK
TAKSRQAVFM LDEIDKIGND FRGDPTSALL EVLDPEQNNT FSDHYLEIPF DLSQVVFVAT
ANQLEPIPAP LRDRMEIIEI GGYTEDEKLA IAQGFLLPKQ REFHGLESSQ LELTDAAILK
LIREYTREAG VRNLEREVAA LCRKIARKVA ESQDEEAPQE GKKRRKKAKK AEPTKFLIDA
ADVPIYLGPE HFSFGMAEVS DQIGVATGVA WTPTGGDILS FEVLPLTGKG ELRLTGQLGD
VMKESAQAAM SYVRYRAKEL GIEPNYFDEH SIHIHVPEGA VPKDGPSAGI TLTIALISAM
TGRAVRRDVA MTGEVTLRGR VLPIGGLKEK TLAAHRAGIK TFILPKENAK DIVDLPEKVR
QDLQLIPVET MDEVLTIALM PFIRQDVIAS