Gene Haur_4652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4652 
Symbol 
ID5736499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5942748 
End bp5945708 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content53% 
IMG OID641281816 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001547411 
Protein GI159901164 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0586324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCGCG TTTTAACAAT CGGCCTGCTC GTTTTTGGCC TTTTACCCTC GGTTAGTGCC 
CGAACAACCC TGCCCATTCA GTTTGTTATG CCAATTGCCA GCGTAACCAA TCGCGGCCTA
AGCTTCGATA ATCGTGGCTT GCCTCAACGC ACGCTTCAGT TGTACGAGGC TTGGCCTGCA
ACCCAAGCCA AACGCCAGCC TGATCCGTTG TTGCGGGTGG CTGTGCCAAG CTTTCCCCAA
AAAATTGACC CAAATTTACA AAACTATTGG CATGATGCTC CGAGCCAGCC GCAAACGCTG
CTGGTATTTT TGGCCGAGCA AGCCGATTTG AGCTTGGCCA GCACCTTTGA TGATTGGGCT
GCACGTGGTG ATTACGTCTA CAAAACCCTG ACTGACCATG CCCAACATAG CCAAACCCCT
TTGCTAAACG CATTACGGGC GCAAGGCCAC AACCCACAAT CGTTGTGGAT TGTCAATAGT
TTGATTGTTG AGGGTGATCA GCAGTTGGCC TTGGATCTGG CGCAACATCC AGCGGTTGCT
AGCATTGGGG CTAACCATGT TTTTAATCTC CCAAGCGTAG CGACGACTCT TGTAACTGAG
CCTGAGAATG TGGCGTGGGG GGTGGCGGCG GTTGATGCGC CCCATGTTTG GTCTGATTGG
GGCGTGCGTG GTCAGGGGAT TGTGGTTGCC AATATTGATA CTGGCGTTAC CGTAAGCCAT
ACAGCTTTGC TGAATAATTA TCGTGGTTGG TCAGCCAATG GCTTGAGCAA TGATTACAAC
TGGTTTGATC CGCTGTATCA GTATCGTTTG CCAACCGATC CGGCGGGCCA TGGTACGCAC
ACGATGGGCA GTTTGGTGGG AGCGAATGAC CAGCAGGGCA TGGCCTTGGG TGTTGCACCA
GCCGCTCGTT GGATTGCCGC CCGCGCGTGT GGGGCGTTGA CCTGCGATGA CTTGAGTTTG
ATCAGAAGTG CCCAATGGAT GCTTGCCCCA ACCCGGGTTG GCTGCGAACG CAATCAGCAA
ATTGCTTGCG ATCCGCGGCC CGATTTACGT CCACACATCA TCAATAATTC GTGGGGTGGG
CCAGGCGAAA GCACGTGGTA CAGCGGGTAT ATTACTGCGT GGGATGCAGC AGGCATTTTG
AGTGTGTTTG CGGCGGGCAA TTTTGGGCGC TCTGGCTGCT ATACGAGCAC TGCACCTGGT
AATAATGCCA ATGTGTTCAG TGTTGGTGCT GTCGATATTA ATAATCTGAT CGCCGATTTT
TCTTCGCGTG GGCCAACCAG CGATGGTCGT ACCAACCCCG ATTTGAGTGC GCCAGGTGTG
CGTGTTCCTT CAGCTTGGCC GAATGGATCG ACAGCCTTGC TCGATGGCAC ATCGATGGCG
GCTCCTCATG TGAGCGGTAT TGCTGCGCTA ATTTGGTCGG CGAACCCGCA GTATATTGGC
GATTTAGCTG CAACCCAAGC CTTGTTGACG AACACAAGCG AAGCGCGTTA CTCGGCCCAA
TGTGGCGATG CGCCAACAGC ACGGCCCAAT AATGTGTATG GTTGGGGCAG TGCCGATGCC
TATGCAGCGG TACGTCAGGC ACGAGTTGAT GTGGCTTGGC TGAGTTTGCC TGAGCAGTTG
CTTGTTCCAG CGAATACCCT CGTGACGATT CCGATCACTT TGGATACCCG CCAAGTCAGC
GCAGCGGGGA GCTATCGGGC GAATGTTTTG GTGGTCGCTA GTTCAGGTAC AAACACCTTT
GAATTAGAAC TGATCGTCGA GGCCGCAGCC AATACTAGCC AATTTACCGG CCAATTAGTT
GATCGGTGGC ATGGACGTGG GGTGTATGGG CGGGTCAGCA TTGGTGGCGG GCCTTCTAGT
TATACCGATC CAACCGGCCA TTACACCATG ACCCTGACAA CCAGCAGCCA TGAGATCTCG
ACCCAAGCTA CTGGCTATCA TCCAGCGGCT ACGATGGTTG ATTTAAATCT CCAGCAGACC
AATGTGCTGA CATTAACGCC TGATATTCCG CATATGCTGG CTGAAATTCC GCCGATCAGT
GCTAGCTTGG CTTTTGCTGA ACAACGCACA TTTGCGGTAA CCTTGACCAA TGCGGGCACT
CAGCCGCTAG TCGTTTCGCC GCATGTGCCA AATCAGGAAT GGCAGATTAC CCCAGTCCCG
AGAACCGCCC TATACGATAC AACTGGCTTG GCCGAATTAA AGCTCGATGA TGACCAAGTG
TATACCGATG CCTTGGATTT GGGCTTTAGT GCGCCATTGT TTGGCACTTT GGCAAACAAG
GTTTATCTGA GTTCAAATGG CTGGGTTTCA TTGAATCAAA CGCGGAGTGC TGCCCCTAGC
GCTAACTGCT TTCCGGCCAA CAATTTGCCC AATGCCACGC TTGCCCCCTT CTGGACTGAT
CTTGATCCTT CAGAGGGTGG CATTATCCGT GCTGGAAGGG TTGATGCTGA TACGTTTGTG
GCGAGTTATG AACAGGTTCC AATCTGGCAA GAAGAACATC TTCCCACGGC TGCCCCAACC
TACACCTTCC AATTAATTAT TGAGCGTAGT GGGCAGGTTG AGTATCGTTA TGGGGCGATG
GGCTACTTTC CAGGCCGCTG GGGAGTTGGC ACGCATACCA ATAGTAGTGT TGGGCAGGCT
TTGGGTTGCC ATCAAAGCCA TGAATATTTG GCGGCCCACA ATTGGCAATT GCTCAATCAG
CCAAGTAGTC AGCAATGGTT GAGTGCTACG CCAAGTAGCC TGACGATCGC GCCCAATCAA
CAAGCCACCG TGTTGGTTCA GCTTAAGGGC TTTGGGGCAA TCAGTTGGTT GCAACATCCT
GCGGTCAGCA TTGTGCAGAT CAACAGCAAC GATCCGCGCC AGCCGCAACG TGAAATAACC
GCGAGCGTAG GGTTGCAACT AGCGCCTTAT CAAACCTATG CCAATACGAT TGTGATTAGT
AATCCATTAG CAAACCCTTG A
 
Protein sequence
MWRVLTIGLL VFGLLPSVSA RTTLPIQFVM PIASVTNRGL SFDNRGLPQR TLQLYEAWPA 
TQAKRQPDPL LRVAVPSFPQ KIDPNLQNYW HDAPSQPQTL LVFLAEQADL SLASTFDDWA
ARGDYVYKTL TDHAQHSQTP LLNALRAQGH NPQSLWIVNS LIVEGDQQLA LDLAQHPAVA
SIGANHVFNL PSVATTLVTE PENVAWGVAA VDAPHVWSDW GVRGQGIVVA NIDTGVTVSH
TALLNNYRGW SANGLSNDYN WFDPLYQYRL PTDPAGHGTH TMGSLVGAND QQGMALGVAP
AARWIAARAC GALTCDDLSL IRSAQWMLAP TRVGCERNQQ IACDPRPDLR PHIINNSWGG
PGESTWYSGY ITAWDAAGIL SVFAAGNFGR SGCYTSTAPG NNANVFSVGA VDINNLIADF
SSRGPTSDGR TNPDLSAPGV RVPSAWPNGS TALLDGTSMA APHVSGIAAL IWSANPQYIG
DLAATQALLT NTSEARYSAQ CGDAPTARPN NVYGWGSADA YAAVRQARVD VAWLSLPEQL
LVPANTLVTI PITLDTRQVS AAGSYRANVL VVASSGTNTF ELELIVEAAA NTSQFTGQLV
DRWHGRGVYG RVSIGGGPSS YTDPTGHYTM TLTTSSHEIS TQATGYHPAA TMVDLNLQQT
NVLTLTPDIP HMLAEIPPIS ASLAFAEQRT FAVTLTNAGT QPLVVSPHVP NQEWQITPVP
RTALYDTTGL AELKLDDDQV YTDALDLGFS APLFGTLANK VYLSSNGWVS LNQTRSAAPS
ANCFPANNLP NATLAPFWTD LDPSEGGIIR AGRVDADTFV ASYEQVPIWQ EEHLPTAAPT
YTFQLIIERS GQVEYRYGAM GYFPGRWGVG THTNSSVGQA LGCHQSHEYL AAHNWQLLNQ
PSSQQWLSAT PSSLTIAPNQ QATVLVQLKG FGAISWLQHP AVSIVQINSN DPRQPQREIT
ASVGLQLAPY QTYANTIVIS NPLANP