Gene Haur_4417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4417 
Symbol 
ID5736268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5639571 
End bp5646119 
Gene Length6549 bp 
Protein Length2182 aa 
Translation table11 
GC content52% 
IMG OID641281580 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001547177 
Protein GI159900930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAT TCCCCTTCAT TCGGCAAGCA GCCCTCATTG CAATGACGCT CTCATTGGGA 
CTTCAGGGTG CGGCGCAACC AGCCCGTGCC GAAGTAATGC GTGCTCCGAC TGCGACGACC
AAGTTCCTCG TTGAACCAAA CAGCCCAGCC AACAAATTTG TCCGCACGAA TGGCACATTG
ATTGCCGATT ATGGTTCTTT TTCAGTCTGG CAAGTTGCCA ATAACAACAC CAGCAGTCTT
AGTCAATTGG CTGGTGCACA AGCCGCCGAT TTGGATACGA TTGGCTTACG GGGAATCAGC
TTCAATCCTT TAAAGAGCGT TCCCACGCCG CAACCAAGCC TGAGCCAAAG CCCAACTGCC
GACCAACAAT TGTGGTTGGT GCAATTTGCG GGGCCGCTCA AGGATGCTTG GTTGGATGAA
CTAACCAAAG CTGGCGCGGA ACGCGTCATA TATATGCCCA ACAACGCTTA CCTCGTTTGG
GCAAATGGCC AAACTGTGGC CAAACTTGAT GCACTCAGCA AGAGCAATGG GGCGATTCAA
TGGGCTGGTG CCTACCACCC CGAATATCGC TTAGCTCCTG AGTTCCGCCA AAAAGCCAGC
AACCCTGATG CGAAGGGGCT GGTCGATGTT ACGATTCAGT TCTACAACAG TGCTACGGTT
GAACGTGATG TGCGCGAAGT GCTTGATGCC AGCGCCCAAG TGTACGCGAC CCCATGGCAG
GTGCTCAATT TCACTACAAT TTCGGTGCAA GTCAGCGAAG CCGCTTTGGC TCCAATTGCT
CGCCGCGCCA ATGTCTACAA CATTGAAGCG TGGGAAGCTC CTCAAAAAAT GGATGAGCGC
CAAGGCCAAA TTATTGCTGG TAACGTAACC ACCGCTGGTG GCAAAGTGGT GCCGAACGGC
CCAGGCTACT TAAGCTGGTT GCAAAGCCAA GGCGTGCCAA CCGACCCCAA CCAATATCCA
ATTGTCGATG TTGTCGATGA TGGTCTGGAT AATGGGACAA CCTCACCATT GCATCCTGAC
TTCTATGTCA ATGGGGTTCG GCCTGGTACT TCACGGATTA CAGCTGTTGG TAACTGTACT
GCCGATGCCA CTGGTAATGG TCAGGCAGGC CACGGTAACC TGAATATTGG GATCGTTGGT
AGCTACAACA ACCTGACTGG CTCGCCCCAC GTTGACGGTG CTAGCTCAGG GATTGCTGGC
GGCTATCGCG TTGGCTTAGG CATTTCGCCA TTCAGCCGAA TTGCCAGCAC CAAGATCTTC
AACAATGCTG GGAGCTTTGA TCTGACCAAT TGTGGTGGCG GTAGCAACTA CATTGGAATT
ATTGCAATGG CCTACAGCCA TGGCGCTGCC ATCTCGTCCA ACAGTTGGGG TTCAAACAGT
GGTGGTGCCT ATACCGCTTC ATCACAAGCC TTCGACCAAG GAACCCGCGA TGCAAGCAGC
ACCACTGCTG GCAACCAAGA AATGACTCAC ATCATTGCCG CTGGTAACGC TGGTTCAGGT
ACGAACACAG TTGGCTCACC TGGTACAGCC AAAAACGTTA TTACCGTAGG TGCAACCGAA
AACGTGCGCG ACGAAGGCGT GCTCGACGGC TGTAACGAAG GCAATGCTGA TAGTGCTGAT
GATATTGCCA GTTTCTCCAG CCGTGGTCCA ACCGATGATG CACGGGTAAA ACCTGACATC
ATGGCTCCCG GGACGCACGT GATGGGGCCA GCTTCACAAA TCGCCACCTT TGATGGGACT
GGCCTTTGTG GGCCAAGCAG TGGCACTCAC TACCCCAATG GTCAAACCCT CTATAGCTGG
TCGAGTGGTA CGAGCCACTC AACCCCAGCA GTTGCAGGCT CGGCTTCATT GCTCTACACC
AAATATCGCA CCAGCTTCGG CGGTGGGGTT GCTCCAAGCC CAGCAATGTT GAAAGCCTAT
ATGTTGGCCT CAACACGCTA CCTCAACGGG AATGGGACTG GCGGCACCTT GCCATCAAAT
AACCAAGGCT GGGGCGATGT AAACTTGGCT CCGGCCTTGG ACAGCACCCC ACGGATCGTG
GTTGACCAAA CCCGCACCTT CGGCGCAACG GGTGAAGAAT TTACCCAAGT TGGTCAGATT
GCCGATTCAG CCAAACCAAT TCGGATTACC TTGGCTTGGA CTGACGCAGT TGGCAGCACC
ACTGGCAACA GCTATGTCAA CGACCTTGAT CTCGAAGTGA CGGTTGGTGG CCAAACCTAC
AAAGGTAACG TCTTCAACGG CGCACTTTCA ACCACTGGCG GCACTGCTGA TAACAAGAAC
AACATTGAAG GTGTGTACTT GCCAGCAGGC ACAAGTGGCT CGATCCAAGT CAAAGTCATT
GCCCGCAACA TCGCTGGCGA TGCCATCCCA GGCAATGCTG ATACCACCGA CCAAGACTTT
GCCTTGTATG TCTACAACGG CTCGGTTGGC CCAACCGGTA CCTTGACTGG CCGCGTTGCC
CAAAGCAACA ACAACCCAGT TGTTGGCGCA ACGGTTCGCA CCAGCGGTGG CTCAAGCGCT
TTGACCAACG CCAGTGGTAA CTACAGCATG ATCGTACCAG TTGGTACCTA TGCGGTAACC
GCTAGCTTGC AAGGTGCTTT CCAAACTGTT CCAAGCGTAT CGATTACCGA AAATGTCACC
ACGACCCAAA ACTTTGTATT GAGCTACGGC TCAATTACAG GGACGGTACG TGATGCCTTT
ACCCCCAGCC GCCCAATTGT GGGAGCAAAT GTTTCGGTCA TTGGCTATAG CACCACAACC
AATGCTGCTG GTCAATACAC CATTCCAGTA GCCAATGTTG GCTCAACCTT GGTAACGGTC
AGTGCACCAA AATACGCCAC CCAATCACAA TCGGTAACGG TGGTTGATGC AGGCACGGCG
ACCCAAGACT TTACCTTGGC TGCTGGCGCG GTCGCAGGGA TTGTGACCAA CAGCGCTAAC
GGTGCACCAG TTGGCGATGC AACGGTCGCC ATCAGCAACC AATACGTCAA GACCAAAGCT
GACGGTAGCT TTGCCATTCG CTTGGAGCCA GGCAGTCATA CCCTGACCGC TTCAAAGATT
GGTTTCGCTG CTGATAGCGC CAGCTTGACG ATTAGCAATG GGGTTACGAC AACCCAAGAT
CTGGAAATTA CACCAGCTTT TGGTTATACC CCAGCCAGCT TGACCCGCAC CTTTACCTTT
GGTGATGCGC CATTCACCGA TACGGTTGGC TTAGAATTAA CCAATAATGG CACTCAGCCA
TTCACCTACA CCATTCGCGA AAATGGTGCG ACTGGCTTTA CCCCAGCGGT CAACCGCGCT
GGTGGCAACG TCTTGGTGGT TCAACGCAAC TCAGCAACCT CGGCAACAGC TGCAACTACG
GCTTTGACCG CGCTTGGCTA TACCTTTGAA TCAATCGATA ACATTGCATT CGAAGCCCGT
AGCTTGGCCA ATTTGCAAAC TTTTGATGCA GTACTCTTCC TTGGTTCAAC TAACGCAACC
GCCAATAATG CCAGCGAAAC CAAACTTACT GAATACCTCA ACGCAGGTGG TAAGCTGTTT
ATCGCTGACA ACGACCTTGG TTTCTTCACC AATGCAGGCA CGTTCTACCG AACCATGCTT
GACTCAACCT ATGGTGGTGA TGATCCAGGT GCTGCGAATC GGGCCTTGAC TGGCTTCGAT
TTCATGGCTG GCGTAAATGC TGCCTCAGTG GATGCATACC CTGACTTCTA TACTCCAGGT
AGCTCATCGA CGGTAATCTT CCGCTATGGC AATAACGCAG TTGGCGGTAG TTTCATCCAA
CGTAATGGCT ACAAAGCTGT CTACCTCGCA ACCGACTTCA TTAACTTGGG CACTGGTGCC
AGTGGCGAAG CGATCGAGCG CACCGTGCTT GAGCGTGTGT TGCAAGCAAT TGTTGGTAGT
GGCGATAGCA TTGCCTGGTT GCAGGAAGCT CCTGCGACTG GCGTAATTGC TGGTGGTGAT
AGCACCAACG TAGCAATTGG CTGGAATCCC GGTGTGATTA CCCAACCTGG CACCTACACT
GGCAGCCTGA ACCTTGGCTA CTCCTCAGTG ATTAGCCAAA CCTTCAATAT TCCAGTGACG
ATGGTGGTCA ACCCAGCCGC AACTCAAGCA CGAATTAGTG GTACGGTTAC CTCATCAGGG
GTCTGTGACA CTGTTCCAGC TCCAGCTGCT GGCGCTACAA TTTTGATCAC CTATACCAGC
GGGATGACTG CAACAGTTAC CACCAATGCC AATGGTGAAT ATGCCTTCTT CGTGCCTCAA
GGTGGCACCT ATACTGTTGC CGCTAGCGCC GTTGACCACC TTGGTCAATC GCAAAGCGTC
ACAGTTGCTG ATGGTGCTGA AGTTACCCAA AACTTCACCT TACGCTTGAA TAAGCCTTGT
GTCACGGTTA GCCCAGGCTC ATTGAGCGCT TCGCTCCAAT TGGGCGCTGC TCCGGTTAAC
CAAACCTTGG TTGTCACCAG CAAGGGTGCT GCCCCGCTCA ATGCAACCAT TAGTGAGCAA
AGCCGCAGCG CAATCAGCCA AGCTGGCTAT ACCATCTCGG AAGTTCCCTA TAGCTGGCTC
GAAGCCAACG ATGGGACGAA CTTGAACTTA ACCGATGATG CTGAAGCCAA CATTACTAGC
CCATTCACTG TAACCATCTT CAACACCAGC AGCCGTAACT TGCGGGTTGC TAACAATGGG
GTTGTCTTGG TCGGTGCAAC CACTGGTGAT GTAGCCGCAA CTAACGAATC CTTGCTAACT
GGCGCAACCA ACAATGTTAT CTCGCCATTC TGGGATGATG TTGATGATGA ATCAGGGGCA
ACCTACTGGA AAGTTGTTGG TACTGCGCCA AACCGCTCAT TGATCGTTCA GTGGGAAAAT
CGCCCGCACT ATAACGGTAT TGGTAATGCA ACCTTCCAAG TTGTAATCAA CGAACAAGGT
GGCATGATCT ACCAATATAA AGATCTCGAT TACGGCGATC CGTTGTACAA CAATGGTTTG
AGTGCAACCG TTGGTGTGCG TGGAGCAAGC ACGGCTCAAG CTGCTCAATT CAGCTTCAAC
CAAGCTCGCT TGCGCAGCGA AATGGCCTTG TGTGTTGGCG CTACCTGTGA TGCCATTTCA
TGGTTGACTG AAACGCCAAA CACGATTACT GGTTTGGCTG GTACGCCAGT CAGCACCCAA
GTCGTCAATG TGAGCTTCTC GACCAACGGT CTGACCCAAG CTGGGGTTTA TACCGGTAAC
TTGGTCTTGA ACCACAACGC TCCTCAACCA GTGGTAACTA TCCCTGTTAC TTTGACGGTT
ACTGCTCCAC CGAACTTCGG TACGATCAGT GGTACGATTC AAGGCTTAGC TGCATGTGAT
GTCTCACCTG CTCCATTAAG TGGCGCAACC GTGACGATCA ACACCACGCC AGCAACTGTC
TTGACCACCA GTGGTTCGGG GACATATAGC TATAGTTTGG CTGCTGGTAG CTACACGGTT
ACCGTTGCCA AGGCTGGCTA TGTCACCCAA ATCTACAACG TGACGGTTCC TACCGCTGGC
ACAGTGACCC AAAATGGTCA ACTGCGCTTG AATGCGCCAT GTTTGGATGT TGATACCACT
CCAATTAGCG AAACGGTTGC CATCAACAGC ATTACCACCC AAACCCTCAC CTTGGATAAC
AACGGTGCAG CGGCCTTGAC GTGGACAATC GCTGAATTGG CTGCAAGTGG TCGCCAAGCC
CCAAGTGGCG AAGCAGCACC AACCCGCAAG CCAGTTCCAG CCAACTTACG CTCTGCTGGT
AGCAAGCAAA CTGTTCCAGC CGATGAAGTT GTTCAAGATG GCAGCTTCGA GGAAACCAAT
GCTGGCGATA GTTCGAATCC AAACTGGAGC CAAACCTCAA CCAACTTCGG GGTTGTGCTC
TGTACCACCG GATGTTCAAC TGGCACCGCC GACGTTCCAC ACACTGGAAC TTGGTACGCT
TGGTTTGGTG GTCTGACCCC ATCACAAAGC CCACCAGATG AGTTGGGCAC GATCTCGCAA
ACCTTTACCG CTGCTGGCAA TGCCTCAGGT ACCTTATCGT TCTGGTTGTT TATGAACGAA
CAAAGCGGTG TTGATAGCGA CTATCTCAAG GTTAAGATCG ACGGTACTAC GGTCTTCACG
GTTACCAACG CAGAAGTTGC CGACTTCCCA ACCTACACGT TGGTCGAAGT TCCAATCAGC
GCAGCCGTTT TGGCCGATAC CACCAACCAT ACCTTGTTGT TTGAATCGTT TGTTGATGGC
AGTGGTAACA CCAACTTCTT CCTCGACGAT GTTTCACTCG ATATTGGTGG CGGTGCATCA
TGTGTTGCTG ATGCTGCTCC ATGGGTTCGC GCAGTACCAA ATAGCGGTAC GATTGCAGCC
GATGGCTCAG CCAATGTGGC ACTTGAATTC AACACCAATG GCTTGGCCGC TGGTCTCCAC
ACAGTCAACC TGTGTATCCA AACCAACGAC ACAACCCGAC CAAATGTTCA AATTCCAGTC
ACTATCAACG TGACTGAGAA TGTTGACCCA ACCCGTAAGC TCTACTTACC AATGGTCAGC
AAGAACTAA
 
Protein sequence
MAKFPFIRQA ALIAMTLSLG LQGAAQPARA EVMRAPTATT KFLVEPNSPA NKFVRTNGTL 
IADYGSFSVW QVANNNTSSL SQLAGAQAAD LDTIGLRGIS FNPLKSVPTP QPSLSQSPTA
DQQLWLVQFA GPLKDAWLDE LTKAGAERVI YMPNNAYLVW ANGQTVAKLD ALSKSNGAIQ
WAGAYHPEYR LAPEFRQKAS NPDAKGLVDV TIQFYNSATV ERDVREVLDA SAQVYATPWQ
VLNFTTISVQ VSEAALAPIA RRANVYNIEA WEAPQKMDER QGQIIAGNVT TAGGKVVPNG
PGYLSWLQSQ GVPTDPNQYP IVDVVDDGLD NGTTSPLHPD FYVNGVRPGT SRITAVGNCT
ADATGNGQAG HGNLNIGIVG SYNNLTGSPH VDGASSGIAG GYRVGLGISP FSRIASTKIF
NNAGSFDLTN CGGGSNYIGI IAMAYSHGAA ISSNSWGSNS GGAYTASSQA FDQGTRDASS
TTAGNQEMTH IIAAGNAGSG TNTVGSPGTA KNVITVGATE NVRDEGVLDG CNEGNADSAD
DIASFSSRGP TDDARVKPDI MAPGTHVMGP ASQIATFDGT GLCGPSSGTH YPNGQTLYSW
SSGTSHSTPA VAGSASLLYT KYRTSFGGGV APSPAMLKAY MLASTRYLNG NGTGGTLPSN
NQGWGDVNLA PALDSTPRIV VDQTRTFGAT GEEFTQVGQI ADSAKPIRIT LAWTDAVGST
TGNSYVNDLD LEVTVGGQTY KGNVFNGALS TTGGTADNKN NIEGVYLPAG TSGSIQVKVI
ARNIAGDAIP GNADTTDQDF ALYVYNGSVG PTGTLTGRVA QSNNNPVVGA TVRTSGGSSA
LTNASGNYSM IVPVGTYAVT ASLQGAFQTV PSVSITENVT TTQNFVLSYG SITGTVRDAF
TPSRPIVGAN VSVIGYSTTT NAAGQYTIPV ANVGSTLVTV SAPKYATQSQ SVTVVDAGTA
TQDFTLAAGA VAGIVTNSAN GAPVGDATVA ISNQYVKTKA DGSFAIRLEP GSHTLTASKI
GFAADSASLT ISNGVTTTQD LEITPAFGYT PASLTRTFTF GDAPFTDTVG LELTNNGTQP
FTYTIRENGA TGFTPAVNRA GGNVLVVQRN SATSATAATT ALTALGYTFE SIDNIAFEAR
SLANLQTFDA VLFLGSTNAT ANNASETKLT EYLNAGGKLF IADNDLGFFT NAGTFYRTML
DSTYGGDDPG AANRALTGFD FMAGVNAASV DAYPDFYTPG SSSTVIFRYG NNAVGGSFIQ
RNGYKAVYLA TDFINLGTGA SGEAIERTVL ERVLQAIVGS GDSIAWLQEA PATGVIAGGD
STNVAIGWNP GVITQPGTYT GSLNLGYSSV ISQTFNIPVT MVVNPAATQA RISGTVTSSG
VCDTVPAPAA GATILITYTS GMTATVTTNA NGEYAFFVPQ GGTYTVAASA VDHLGQSQSV
TVADGAEVTQ NFTLRLNKPC VTVSPGSLSA SLQLGAAPVN QTLVVTSKGA APLNATISEQ
SRSAISQAGY TISEVPYSWL EANDGTNLNL TDDAEANITS PFTVTIFNTS SRNLRVANNG
VVLVGATTGD VAATNESLLT GATNNVISPF WDDVDDESGA TYWKVVGTAP NRSLIVQWEN
RPHYNGIGNA TFQVVINEQG GMIYQYKDLD YGDPLYNNGL SATVGVRGAS TAQAAQFSFN
QARLRSEMAL CVGATCDAIS WLTETPNTIT GLAGTPVSTQ VVNVSFSTNG LTQAGVYTGN
LVLNHNAPQP VVTIPVTLTV TAPPNFGTIS GTIQGLAACD VSPAPLSGAT VTINTTPATV
LTTSGSGTYS YSLAAGSYTV TVAKAGYVTQ IYNVTVPTAG TVTQNGQLRL NAPCLDVDTT
PISETVAINS ITTQTLTLDN NGAAALTWTI AELAASGRQA PSGEAAPTRK PVPANLRSAG
SKQTVPADEV VQDGSFEETN AGDSSNPNWS QTSTNFGVVL CTTGCSTGTA DVPHTGTWYA
WFGGLTPSQS PPDELGTISQ TFTAAGNASG TLSFWLFMNE QSGVDSDYLK VKIDGTTVFT
VTNAEVADFP TYTLVEVPIS AAVLADTTNH TLLFESFVDG SGNTNFFLDD VSLDIGGGAS
CVADAAPWVR AVPNSGTIAA DGSANVALEF NTNGLAAGLH TVNLCIQTND TTRPNVQIPV
TINVTENVDP TRKLYLPMVS KN