Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0703 |
Symbol | |
ID | 5732604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 808733 |
End bp | 811258 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277833 |
Product | peptidase U32 |
Protein accession | YP_001543479 |
Protein GI | 159897232 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC AACGCCGTTT TTCCAAGCCC GAAGTGATGA GTCCAGCGGG CTATTGGCCG CAGTTGCATG CCGCAATCGA AGCAGGTGCA GATGCGGTTT ATTTTGGTTT GAAGCATTTT ACCGCCCGCG CCAAGGTGGG TTTTACGCTC AGCGAGTTGC CCGACGTAAT GCGTACCCTG CATCAACGTG GTGTGCGCGG CTTTGTGACC TTCAATACCT TGGTGTTTGA ACACGAATTG CGCGAAGCCA CCCGCTCGTT GGCAGCAATT GCCGAGGCTG GAGCCGATGC GGTGATTGTG CAAGATATTG GCATCGCTCA GCTTGCCCGC CAAATCGCCC CCGAAATGGA AGTGCATGGC AGCACTCAAA TGAGCATCAC CAGCGCAGAA GGGGTTGAAT TAGCGCGTCG TTTCGGAGCC AATCGGGTAG TTTTGGCCCG TGAATTATCG CTGGCTGAAA TTGCCGCAAT TCGCCAAGCC ACCGATTGCG AGCTAGAAAT GTTTGTCCAT GGAGCCTTGT GCGTCTCGTA TTCGGGCCAA TGTTTCTCCT CGGAAGCATG GGGCGGGCGT AGTGCCAACC GCGGCCAATG CGCTCAAGCC TGTCGGTTGC CCTACCAATT ATTGGTTGAT GGCAAGCACA AACCCTTGGC CGATGCACGG TATTTGCTCT CGCCAGGCGA TTTGTATGCT GTGCCCTTGA TGCACGATAT TATCAAATTG GGTATTTCGT CGCTGAAAAT CGAAGGCCGC TACAAAGATG CCGATTATGT GGCCTTGACT ACCAGCGCCT ATCGCAAGGC GGTTGATGAG GCTTGGGCTG GCTTGCCCTT GAGCATTACA CCCGCCGAAG AGTTGCAATT AGAGCAAGTC TATTCGCGTG GCTTAGGCAC ATTTTTCATC GGCGGAACCA ATCATCAAAC CGTAGTCAAT GGCCGTGCAC CGCGTCACCG TGGCGTATTG ATCGGTACTG TCAGCGATAT CTATGGCGAA CGGATTGTGC TCAAGCCCCA CGAAAATCAC GCCATCGCCC CGCTCAAGGC TGGCGATGGC GTGGTCTTTG ATGCCGCCAA CTGGCGCAGC CCTGAAGAAC GTGAAGAAGG TGGGCGAATT TATGGCATCG AAGCCTTGCC AACTGGCGAA ATTGAGCTAC GTTTCGGTCC GCAAGCAATC AAGCCTAACC GCATTCGCAT TGGCGATTTG ATTTGGCGTA GCCACGATCC TGAGCTTGAT CGCGCTGCCA AGCCGTTTTT AGAGGCCGCC GCGCCTGTCG CCAAACAACC TTTGCAGGTG CATGTTGTAG CCCACGAAGG CCAGCCTTTA CAATTAACCT GGACGGTCAG CAATTATCCT CAATTCAGCG TAACCGTGCA ATCGCCTGAG CCATTGCCAC GCTCGCAGAA CCGCGCCTTG ACCAACGAAT TTCTGTTTGA TCAAGCTAGC CGTTTGGGTA ATACCGCCTA CAGTTTGGCC GAAATGAATG TTGATAGTAG TGGCCAGCCG TTTGTGCCCA GCTCGTTGCT GAATAACCTG CGCCGCCAAG CAATCGAACA ACTTAGTGCC TTGCAAGCCG CCACGCCAAC GCGCCAGATT CAAGCGCCAC TTGAGGTTTT GGCCCAAGCA CAGCAAACTG TTCAAGCAAC TCCAAGCCTC AGCAATACGC CTCAATTGCA TATTTTGGTA CGCACGCCTG AGCAATTACA AGCTGCCATC GCCGCCAAAC CGGACAGCAT CACCCTCGAT TATTTGGAAT TGTATGGCTT GAAGCCAGCA GTCGAGTTAG TCCAAGCTCA TGGAATTACG GTGCGCGTGG CTAGCCCACG GGTGCTCAAA CCTAGCGAAC AACGAATTAT TCACTTTTTG CTCAAGCTTG GCTGCGAAAT TGTAGTACGT TCAACGGGCT TACTCGAAGC CTTGCGCCAA GAGCAACACC CAGCCTTGAT TGGCGATTTT AGTCTGAACG CCGCCAATAG CATTACCGCC AACACCCTGC TCGAATTAGG CTTAGAGCGG ATTACGCCGA CCCACGATTT GAATGCGGCC CAAGTTGCGG CCTTGGCTGA GCAATTGGGC AGTGCAGCGG TTGAAGTGAT CGCCTATCAA CATTTGCCTG TATTCCACAC CGAGCATTGT GTGTTTTGCC GCTTCCTCTC AAATGGTACG AGCTTCAAAG ATTGTGGCCA TCCCTGCGAG AAGCATCGGG TCGAGTTACG CGATCTGAAT GGGCGGGCGC ATCCAGTAAT GGCTGATGTT GGCTGTCGCA ATACGGTGTT TGGTGCTGAA GCACAAACGG CTGTGGAGCA TCTTGATGCT TGGCGAGCTG CTGGTATTGT GCATTATCGC TTGGAATTTG TGCACGAAAC GGCTGAACAA GTAGCGGCAA TTATTGCAGC CTTTGATCGC ACCTTGCTCG GCCAACAACC CAGCAGTGTT TTGGCCGAGC AATTACGCAA GGCAGCGCCG CAGGGCATCA CCCAAGGCAG TTTGTTTATT CCCAATGATT ATCTGCGCGT ACCATTGATG CAGTAG
|
Protein sequence | MSEQRRFSKP EVMSPAGYWP QLHAAIEAGA DAVYFGLKHF TARAKVGFTL SELPDVMRTL HQRGVRGFVT FNTLVFEHEL REATRSLAAI AEAGADAVIV QDIGIAQLAR QIAPEMEVHG STQMSITSAE GVELARRFGA NRVVLARELS LAEIAAIRQA TDCELEMFVH GALCVSYSGQ CFSSEAWGGR SANRGQCAQA CRLPYQLLVD GKHKPLADAR YLLSPGDLYA VPLMHDIIKL GISSLKIEGR YKDADYVALT TSAYRKAVDE AWAGLPLSIT PAEELQLEQV YSRGLGTFFI GGTNHQTVVN GRAPRHRGVL IGTVSDIYGE RIVLKPHENH AIAPLKAGDG VVFDAANWRS PEEREEGGRI YGIEALPTGE IELRFGPQAI KPNRIRIGDL IWRSHDPELD RAAKPFLEAA APVAKQPLQV HVVAHEGQPL QLTWTVSNYP QFSVTVQSPE PLPRSQNRAL TNEFLFDQAS RLGNTAYSLA EMNVDSSGQP FVPSSLLNNL RRQAIEQLSA LQAATPTRQI QAPLEVLAQA QQTVQATPSL SNTPQLHILV RTPEQLQAAI AAKPDSITLD YLELYGLKPA VELVQAHGIT VRVASPRVLK PSEQRIIHFL LKLGCEIVVR STGLLEALRQ EQHPALIGDF SLNAANSITA NTLLELGLER ITPTHDLNAA QVAALAEQLG SAAVEVIAYQ HLPVFHTEHC VFCRFLSNGT SFKDCGHPCE KHRVELRDLN GRAHPVMADV GCRNTVFGAE AQTAVEHLDA WRAAGIVHYR LEFVHETAEQ VAAIIAAFDR TLLGQQPSSV LAEQLRKAAP QGITQGSLFI PNDYLRVPLM Q
|
| |