Gene Haur_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0703 
Symbol 
ID5732604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp808733 
End bp811258 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content53% 
IMG OID641277833 
Productpeptidase U32 
Protein accessionYP_001543479 
Protein GI159897232 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC AACGCCGTTT TTCCAAGCCC GAAGTGATGA GTCCAGCGGG CTATTGGCCG 
CAGTTGCATG CCGCAATCGA AGCAGGTGCA GATGCGGTTT ATTTTGGTTT GAAGCATTTT
ACCGCCCGCG CCAAGGTGGG TTTTACGCTC AGCGAGTTGC CCGACGTAAT GCGTACCCTG
CATCAACGTG GTGTGCGCGG CTTTGTGACC TTCAATACCT TGGTGTTTGA ACACGAATTG
CGCGAAGCCA CCCGCTCGTT GGCAGCAATT GCCGAGGCTG GAGCCGATGC GGTGATTGTG
CAAGATATTG GCATCGCTCA GCTTGCCCGC CAAATCGCCC CCGAAATGGA AGTGCATGGC
AGCACTCAAA TGAGCATCAC CAGCGCAGAA GGGGTTGAAT TAGCGCGTCG TTTCGGAGCC
AATCGGGTAG TTTTGGCCCG TGAATTATCG CTGGCTGAAA TTGCCGCAAT TCGCCAAGCC
ACCGATTGCG AGCTAGAAAT GTTTGTCCAT GGAGCCTTGT GCGTCTCGTA TTCGGGCCAA
TGTTTCTCCT CGGAAGCATG GGGCGGGCGT AGTGCCAACC GCGGCCAATG CGCTCAAGCC
TGTCGGTTGC CCTACCAATT ATTGGTTGAT GGCAAGCACA AACCCTTGGC CGATGCACGG
TATTTGCTCT CGCCAGGCGA TTTGTATGCT GTGCCCTTGA TGCACGATAT TATCAAATTG
GGTATTTCGT CGCTGAAAAT CGAAGGCCGC TACAAAGATG CCGATTATGT GGCCTTGACT
ACCAGCGCCT ATCGCAAGGC GGTTGATGAG GCTTGGGCTG GCTTGCCCTT GAGCATTACA
CCCGCCGAAG AGTTGCAATT AGAGCAAGTC TATTCGCGTG GCTTAGGCAC ATTTTTCATC
GGCGGAACCA ATCATCAAAC CGTAGTCAAT GGCCGTGCAC CGCGTCACCG TGGCGTATTG
ATCGGTACTG TCAGCGATAT CTATGGCGAA CGGATTGTGC TCAAGCCCCA CGAAAATCAC
GCCATCGCCC CGCTCAAGGC TGGCGATGGC GTGGTCTTTG ATGCCGCCAA CTGGCGCAGC
CCTGAAGAAC GTGAAGAAGG TGGGCGAATT TATGGCATCG AAGCCTTGCC AACTGGCGAA
ATTGAGCTAC GTTTCGGTCC GCAAGCAATC AAGCCTAACC GCATTCGCAT TGGCGATTTG
ATTTGGCGTA GCCACGATCC TGAGCTTGAT CGCGCTGCCA AGCCGTTTTT AGAGGCCGCC
GCGCCTGTCG CCAAACAACC TTTGCAGGTG CATGTTGTAG CCCACGAAGG CCAGCCTTTA
CAATTAACCT GGACGGTCAG CAATTATCCT CAATTCAGCG TAACCGTGCA ATCGCCTGAG
CCATTGCCAC GCTCGCAGAA CCGCGCCTTG ACCAACGAAT TTCTGTTTGA TCAAGCTAGC
CGTTTGGGTA ATACCGCCTA CAGTTTGGCC GAAATGAATG TTGATAGTAG TGGCCAGCCG
TTTGTGCCCA GCTCGTTGCT GAATAACCTG CGCCGCCAAG CAATCGAACA ACTTAGTGCC
TTGCAAGCCG CCACGCCAAC GCGCCAGATT CAAGCGCCAC TTGAGGTTTT GGCCCAAGCA
CAGCAAACTG TTCAAGCAAC TCCAAGCCTC AGCAATACGC CTCAATTGCA TATTTTGGTA
CGCACGCCTG AGCAATTACA AGCTGCCATC GCCGCCAAAC CGGACAGCAT CACCCTCGAT
TATTTGGAAT TGTATGGCTT GAAGCCAGCA GTCGAGTTAG TCCAAGCTCA TGGAATTACG
GTGCGCGTGG CTAGCCCACG GGTGCTCAAA CCTAGCGAAC AACGAATTAT TCACTTTTTG
CTCAAGCTTG GCTGCGAAAT TGTAGTACGT TCAACGGGCT TACTCGAAGC CTTGCGCCAA
GAGCAACACC CAGCCTTGAT TGGCGATTTT AGTCTGAACG CCGCCAATAG CATTACCGCC
AACACCCTGC TCGAATTAGG CTTAGAGCGG ATTACGCCGA CCCACGATTT GAATGCGGCC
CAAGTTGCGG CCTTGGCTGA GCAATTGGGC AGTGCAGCGG TTGAAGTGAT CGCCTATCAA
CATTTGCCTG TATTCCACAC CGAGCATTGT GTGTTTTGCC GCTTCCTCTC AAATGGTACG
AGCTTCAAAG ATTGTGGCCA TCCCTGCGAG AAGCATCGGG TCGAGTTACG CGATCTGAAT
GGGCGGGCGC ATCCAGTAAT GGCTGATGTT GGCTGTCGCA ATACGGTGTT TGGTGCTGAA
GCACAAACGG CTGTGGAGCA TCTTGATGCT TGGCGAGCTG CTGGTATTGT GCATTATCGC
TTGGAATTTG TGCACGAAAC GGCTGAACAA GTAGCGGCAA TTATTGCAGC CTTTGATCGC
ACCTTGCTCG GCCAACAACC CAGCAGTGTT TTGGCCGAGC AATTACGCAA GGCAGCGCCG
CAGGGCATCA CCCAAGGCAG TTTGTTTATT CCCAATGATT ATCTGCGCGT ACCATTGATG
CAGTAG
 
Protein sequence
MSEQRRFSKP EVMSPAGYWP QLHAAIEAGA DAVYFGLKHF TARAKVGFTL SELPDVMRTL 
HQRGVRGFVT FNTLVFEHEL REATRSLAAI AEAGADAVIV QDIGIAQLAR QIAPEMEVHG
STQMSITSAE GVELARRFGA NRVVLARELS LAEIAAIRQA TDCELEMFVH GALCVSYSGQ
CFSSEAWGGR SANRGQCAQA CRLPYQLLVD GKHKPLADAR YLLSPGDLYA VPLMHDIIKL
GISSLKIEGR YKDADYVALT TSAYRKAVDE AWAGLPLSIT PAEELQLEQV YSRGLGTFFI
GGTNHQTVVN GRAPRHRGVL IGTVSDIYGE RIVLKPHENH AIAPLKAGDG VVFDAANWRS
PEEREEGGRI YGIEALPTGE IELRFGPQAI KPNRIRIGDL IWRSHDPELD RAAKPFLEAA
APVAKQPLQV HVVAHEGQPL QLTWTVSNYP QFSVTVQSPE PLPRSQNRAL TNEFLFDQAS
RLGNTAYSLA EMNVDSSGQP FVPSSLLNNL RRQAIEQLSA LQAATPTRQI QAPLEVLAQA
QQTVQATPSL SNTPQLHILV RTPEQLQAAI AAKPDSITLD YLELYGLKPA VELVQAHGIT
VRVASPRVLK PSEQRIIHFL LKLGCEIVVR STGLLEALRQ EQHPALIGDF SLNAANSITA
NTLLELGLER ITPTHDLNAA QVAALAEQLG SAAVEVIAYQ HLPVFHTEHC VFCRFLSNGT
SFKDCGHPCE KHRVELRDLN GRAHPVMADV GCRNTVFGAE AQTAVEHLDA WRAAGIVHYR
LEFVHETAEQ VAAIIAAFDR TLLGQQPSSV LAEQLRKAAP QGITQGSLFI PNDYLRVPLM
Q