Gene Haur_4418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4418 
Symbol 
ID5736269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5646522 
End bp5653037 
Gene Length6516 bp 
Protein Length2171 aa 
Translation table11 
GC content50% 
IMG OID641281581 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001547178 
Protein GI159900931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGAT TTTCTTTGGT GCGCCAACTG GTTTTGGCGT TGATGCTGGT GGGTTTGGCA 
TTGCCAGCTG GGCAACCAAG CGCCGCCCAA ACTTCGGCAA ACCCGCAAAC ATTCAGCAAA
CTGTTGATCG ATACTGGTAG CTCAGCGAGC CGTTTCGCCC AAACTCATGG CAAATTGCTG
GTTGATTATG GGGCATTTGG TTTGTGGCAA ATTGCCGATA ATCAACTGAA CCAGGTTAAG
CAATTGGCTG GTGCGACAAC CAACGATCTA GATACGCTTA ACTTTCGTGG GATTCGCTTC
AATCCACTGA AAGCCCAACC AAGCCTCAGA TTAACCCAAA GCCCAACCGC TGAGCATCAA
TTGTGGTTGG TACAATTTAT CGGGCCAATC AAAGATACTT GGTTGGAGCA ACTCACCAAG
GCTGGAGCCG AATTGGTGAT TTATGTGCCG AGCAATAGCT ATTTGGTGTG GGCTGATGGC
GCGAGCCTCA ACAAGCTCAA TCAATTACAA CAAACCAGCC ACGCTATTCA ATGGATGGGT
GTCTATCAGC CTGAATATCG TTTGGCTCCA GAACTACGCA CTAAGGCAAG TAGCCCCAAG
CGGACTGAAT TGGTTGATGT CAGCGTGCAA ATCTACAATG CTGGCGATAT ACAAGCCTCG
GTTGATGCAG TCATTGCGGC GAGTAGCAAA CTGCATGCCC GACCATGGCA GGTGCTCAAT
TTCACCACAC TTTCGGTCCA ACTGCCTGAA ACTGAATTGG CAGCGTTGGC ACAACGAGCG
AATATCTACA ATATTGAGCC TTGGAGTGAG CCAGAGCTGT TTGATGAACG CCAAGGCCAG
ATTATCGCGG GCAATGTGAC GACGCTGAAT GGCAAAACTG TGCCAAGTGG GCCTGGCTAT
TTAAGCTGGT TGCAAAGCCA AGGCTTGCCC AACAACCCTG CCGAGTATCC AATTGTTGAC
GTGGTCGATG ATGGCTTCGA TGATGGCACG ATCAATCCAT TGCATCCTGA TTTTTATCTA
AACGGCGTGC GACCTGGAAC TTCGCGCATC ACCGCCACAT CCAACTGTAC TCTTGACCCG
CGCGGCAATA GTTTGGCGGG TCACGGCCAA ATCAACGCGG GAATTATCGG CGGCTACAAC
AATCTCACAG GCTTTCCCTA TATCGATGAA GCTAGCTCTG GGATTGCTGG CGGCTATAAT
ATTGGGCTTG GAATTGCGCC CTATACCCGT ATGTCCAGCA CCAAAATCTT TCGTAATAGT
GGCTCGTTTA GCATTAGCAA TTGTGATGGT GGCGATAGCT ACGCCGATAT TGTGACTGCT
GGTTATGAGC ATCAAGCGGC GATTACCTCG AATAGTTGGG GCCAGCCAGG CTCAATGGGC
GCGTACAACA TTTATTCACA GCTTTACGAC CAACTAACCC GTGACGCGAG CAGCGACGAT
GCGGGCAACC AAGCCATGTT GCATATTTTT GCCGCAGGCA ATATGGGTGA ATTTGGGGCA
AATACGGTTA GCGCACCGAG CACTGCCAAA AACGTGATTT CGGTCGGTGC AACCGAAAAT
GTGCGCGATG AAGGCGTTTT GGATGGCAAC GGTTGTGAGA TTTCGCAAGC CGATAATGCT
GATGATTTGG CGGTTTTTTC GGGCAAAGGC CCAACCGATG ATAGCCGAAT CAAGCCCGAC
ATTATGGCTC CTGGCACACA CGTAATTGGC CCAGCCCCCC AAGAATCGGG CTTTCTGGGC
GCTGGGGTTT GTGGTGGCCT AACTAATCCA TATTATCCTG ATAATCAAAC CCTTTACACC
TGGTCAAGCG GTACCAGCCA CTCAGCTCCC GCTGTGTCAG GCGCAGCTTC ATTGCTCTAC
ACCAAATATC GTACAAGCTT TGGCAATGGA GCAACCCCAA GCCCAGCTAT GCTTAAGGCC
TATTTGTTGG CTTCGAGCCG CTATTTGGAT GGGGTGAATA CTGGCGGCAA TTTGCCAACC
AATCAGCAAG GTTGGGGCGA TGTCTACCTG AAAACTGCAC TCGATAGCAC CCCTAGAATT
GTGGTTGATC AAAGCCATGT GTTTGGGGCG AGCGGCGAAA GTTTCAGCCA AGTTGGCCAA
ATTGCCGATA GCAACAAAGC GTTGCGGGTC GCCCTCACAT GGACAGATGC AGCGGGCGGT
ACTACTGGCG ATGCCTTTGT CAATGACCTT GATCTGGAAG TGACGGTTGG CGGCCAAGTT
TATAAAGGAA ATGTGTTTAA TGAAACACTC TCAACGACTG GCGGCGTGGC CGATGCCAAA
AACAATGTCG AATCGGTTTA TCTGCCAGCA GGGGCAAGTG GCGCGATTCA GGTGCGGGTG
ATTGCCCGCA ATATTGCTGG CGATGCCATT CCTGGTAATG CTGATACTAC CGACCAAGAT
TTTGCCCTGT ATGTCTACAA TGCTAGCCAA GGCGCGATTG GCACAGTTAC GGGGCGGGTG
AGCAACGCCA GCAATGCTCC CGTTGCTAAT GTGCGAATTG CCACCAGCAA TAATTTAAGC
ACCGTGAGCG ATGCCAATGG GAACTATCGT TTGGTTCTAC CTGTTGGCAC CTATGCTCTG
ACCGCAAGCA TAAATGGCGT ATTTCAAAGT GTGCCAGCGC TCACCATTGG CCAAAATGCC
CAAATTAGCC AAAACTTTAC CTTGGTGTAT GGCAGCATTA GCGGGGTGGT GCGCGATAGC
TTCACGCCCA GTTTGCCAAT TGGTGGAGCC TTGGTTTCGA CCGCTGGCTT CAGTACCTTT
ACTGATAGTT CAGGTGCCTA TCAAATTCCG GTGGCCGCGC CGGGCACGGT GGCGCTGAAT
GTTCAGGCCG ATCGTTACAC GCCGCAGCAG CAAAATCTGA ATGTAGTTGC CAATACCACC
ACCACAGGCA ATTTCAATTT GGCAGCGGGT GCGGTTGAAG GGATGATTAG CGATGCCAGC
AGCGGCCTAG GAGTCAAGGA TGCTGTGGTC ACGCTTGACA GTTACACGGT CAAAACCAAT
GCGGCGGGTT ACTACAGTTT ACGTTTGCCC TTGGGCAGCT ACAGCGTTGG GGCTAGCAAA
GTTGGCCTAA TCGCCGAAAC GCAGAATTTG ACCCTTAGCA ATGGCCTGAC CAGCACATTA
AACCTCAGTT TGATTCCATT GCTCAGCTAT ACACCGAGCA GCCTTAGCCA AAGCTTTGAG
TTTGGGGCGG CTCCGATTAG CGATAGTTTG AGCTTGGAAT TAACCAACAA TACGACTCAG
CCAATTAGCT ATAGCCTGCG CGAATTGAGC GATGCTGGCT TTACACCTGC GCGTAACCAA
CAACGGATTT TGGTGGTGCT GCGCACGGGT AGTGATGATG CTAACGCCGT AACCATCCCG
CTGAGCCAAC TAGGCTATGC ATACGACCAG ATTGAATTTG GCGAATTTGC CACAATGAGT
TTGGCCGATA TTCAAGCCTA CGATGCAGTG ATGTATCTTG GGACAACCGA TACTGCTGCC
AACAACCCGC AAGAAGCCAA ATTGGCCGAA TATCTCGATG CTGGCGGGCG GTTGTTGATT
GCTGATAACG ATTTGGGTTT CTTCACGCGC ACAGGCAGTT TTTATCAGCA ATATCTTGAT
GCGAGCTTTG GCAGCGACGA TCCTAACACT GCCAATTACA ACTTGATTGG GCTAGATTTC
ATGGCCGGAA TCAATCCCAT GGTGGTTGAT TTCTCCCCTG ATTATTTCGC GCCTGGCAGT
TCATCACGCG CGATCTTCCG TTATAGCGAT GGCTCGGTTG GTGGCTCGTA TATTGAGCGT
AACGGCTATA AAGCAATTTA TTTGGCAGTT GATTTCCGTA ACTTTGGCAC AGGTGCCTTT
GGTGAGCGCA TTGAACGCGA TATTGTTGAA GTCAGTCTTG CTCAATTGCT GGGAACCACC
GATCAAATTA ATTGGGTGGA GCTTGCGCCA TTGCGCGGCG AAGTGGCAGC AGGACAAAAC
AGTTCGGTGG CAGTTAGCTG GTTCCCTGAT CGATTAACTC AACCTGGAAC TTACACTGGC
ACGGTGGTTT TGGCCCAAAC AGCGGTCTAC ACCCAAACTG CCGAAATTCC AATCAGCATT
ACGATCACGC CCAACACCAG CCAGGCCCGA CTGAGCGGCG TTGTGACTGG CTCGGGCGTT
TGTAGCAATA CCCCAGCGCC ATTAGCTAAT GTATTGGTGA CGATCAATGA TCAACAAGGC
TTGGTTACCA GCGTGCGCAC CAATAGCGCA GGCGAGTATG TGGTGTTTGT GCCAACAGGC
GACGAGTATA GCCTTGAATT TAGCGCAACC GACCACGTTG CCAGCAGCCA AAGTATTACG
GTTGCTGATG GTGAGCCAAG TGTCAATAAT GTGCAATTGC GGCTAGATAA AGGTTGTCTG
ATTGTTGGGC CACATGCGAT CAATACCAAT GTGGTGTTTG GCGAAAGCAA AACCGAGCAA
CTATTTGTGA TCAGCACCGG AGCGCAAGCG CTGGATGTAG CAATTAGTGA GACCCGTGCT
AAAACCGTCA ACAGCGGCGA TCTAACGCTG ACCGAAGTTG ATTACAACTG GATCGAAGCC
AGCGATGGCA CGAATTTGAA TATGGGTGCT TACGATTTGG TCAATATCGT CACGCCATTT
CCGATTAATT TGTATGGCGT GAGCACCACC GATTTGCGCA TCTCGAATAA CGGCGTGATG
ATTCTCAACA ATCTGACTGG CTTGATCGAA ATCTTCAATC CCAGCCTAGA GAACGCCATC
CATAACTACG TAATTGCACC TTACTGGGAC GATTTGGATG ATGAAACAGG CGGCGTGTAT
TGGAAAGTCG TCGGCGAAGC GCCCAATCGC GCGGTGGTGG TGCAATGGGA AAATCGCCCG
CACTACAACT TTTGGGATAA CACCACTTTC CAAGCGGTGC TCTCAGAACA AGGCGATATT
TTGTTCCAAT ACAAGGATGT TGATTTCAAC GAACCATTCT TGGATTTTGG GGCTAGTGCT
ACGATTGGGG TACGCGGCAC ACGCAGCGAG ATTGCCCAAT ATAGCGTTGA TCAGCCAGTG
CTGCGTGATC GCATGGCCTT GTGTATCTCA CAAACCTGCG ATAGTTTGAA TTGGCTGAGC
GTTAGCCCGA ATAAACTTAG CAACTTGACT GGCACACCAT CGAGTTTCCA AACGGTTGAT
CTCGCGATTG ATACCAGCAA CTTTGAGACG GTTGGCGTGT ACACCACCAA TTTGGTGCTA
AACCACACCA CGCCACAGCC GCCAGTAGTT GTTCCTGTAA CCGTCAATGT AACCTTGCCC
GAAGGCTATG GCGTGCTGAA TGGTTTGGTT GAAACAACCT TGGTTTGTGA TGTTAATCCA
ATGCCTTTGG CCAACGTCAA AATTACGATT GACACTGAGC CGCCAACGGT TTTATATACC
AATAATGTGG GCAACTACAG CCGCCCAATT CCGGCAGGCA GCTACAACGT ATTGGTTGAA
GGCTACCCGG GGGCATTTAC CAGTGTCAGC TATCAACTAA CAGTTGAGGC AGGCCAGACC
TATCAGCAAG ATTCGCTGTT ACGTTTGAAA GCGCCATGTT TGGATACCAG CAGTACGCCA
GCGATTACCA CCACAACCGA ATTGAATATG CCAATCACCG CTAGCTTTAG CTTGAGCAAT
ATCGGCGCAG GTGTGCTCGA TTGGCAAATT GAAGAACGTT TGCCACAACA AAAAGCCTTG
GCAGCCAATG CCCAACGAGT AACGACCAGC CAAACAGAGG CTCAACCGCA ACTGGTTCCA
GCCAGCGAAC GTTTGCTTGA TGGCGGCTTC GAGGCCACTA CGATTAGCGA TAATGTGGCG
ACCAACCCCT ATTGGAGCCA AGATTCGCGC AACTTTGCCA GCTTGCTCTG CACTGTGGAG
TGTGATGATA TTATGCCGCA TACTGGTGAT TGGTTTATCT GGATGGGTGG GATTGGGTCA
AACTTTGGCA CTGAAACCAG CTATTTCAGC CAAGACTTCA GCCAAACCAG CTTTAGCGCA
GGTACATTGA GTTTCTGGTT ATCGGTTACT GCGCCGATGG ATCGGCCTGA TGATTACATG
CGAGTGCTGA TCAACAACAA TGAAGTATTT CGGGTGACCA ATGCCGATCG GGCCAATTAC
GGCAGCTACA CCCTTGTAAC TGTGCCGATT AATGAGCAAG TGCTCGGTGG CCGCGAGTTG
CATAGCATTC GCTTTGAAGC CCAAATTGTC CAAGGCGGCA ACACCAACTT CTTTATTGAC
GATCTGAGCC TTGATTTAAT CCAAAGCTGT GCTGGCGATG CAGTGGATTG GTTGCACGTT
GAGCCGACTC GGGGCAGTAT AGCCGCCGAT AGCGAGCAAA CAATTGATGT TGCGTTTGAC
CCAACGGGCT TGGCAGTAGG CACGCATACT GCAAGTTTAT GCTTGATTAC TAACAACCCC
AATCGCCAAA ATGTACGAAT TCCCGTTAGC TTGACCGTTG AGCCAGCGGC GATCCCGAAC
TATCCACTCT ATCTACCTAT GATTATGCGC AACTAA
 
Protein sequence
MARFSLVRQL VLALMLVGLA LPAGQPSAAQ TSANPQTFSK LLIDTGSSAS RFAQTHGKLL 
VDYGAFGLWQ IADNQLNQVK QLAGATTNDL DTLNFRGIRF NPLKAQPSLR LTQSPTAEHQ
LWLVQFIGPI KDTWLEQLTK AGAELVIYVP SNSYLVWADG ASLNKLNQLQ QTSHAIQWMG
VYQPEYRLAP ELRTKASSPK RTELVDVSVQ IYNAGDIQAS VDAVIAASSK LHARPWQVLN
FTTLSVQLPE TELAALAQRA NIYNIEPWSE PELFDERQGQ IIAGNVTTLN GKTVPSGPGY
LSWLQSQGLP NNPAEYPIVD VVDDGFDDGT INPLHPDFYL NGVRPGTSRI TATSNCTLDP
RGNSLAGHGQ INAGIIGGYN NLTGFPYIDE ASSGIAGGYN IGLGIAPYTR MSSTKIFRNS
GSFSISNCDG GDSYADIVTA GYEHQAAITS NSWGQPGSMG AYNIYSQLYD QLTRDASSDD
AGNQAMLHIF AAGNMGEFGA NTVSAPSTAK NVISVGATEN VRDEGVLDGN GCEISQADNA
DDLAVFSGKG PTDDSRIKPD IMAPGTHVIG PAPQESGFLG AGVCGGLTNP YYPDNQTLYT
WSSGTSHSAP AVSGAASLLY TKYRTSFGNG ATPSPAMLKA YLLASSRYLD GVNTGGNLPT
NQQGWGDVYL KTALDSTPRI VVDQSHVFGA SGESFSQVGQ IADSNKALRV ALTWTDAAGG
TTGDAFVNDL DLEVTVGGQV YKGNVFNETL STTGGVADAK NNVESVYLPA GASGAIQVRV
IARNIAGDAI PGNADTTDQD FALYVYNASQ GAIGTVTGRV SNASNAPVAN VRIATSNNLS
TVSDANGNYR LVLPVGTYAL TASINGVFQS VPALTIGQNA QISQNFTLVY GSISGVVRDS
FTPSLPIGGA LVSTAGFSTF TDSSGAYQIP VAAPGTVALN VQADRYTPQQ QNLNVVANTT
TTGNFNLAAG AVEGMISDAS SGLGVKDAVV TLDSYTVKTN AAGYYSLRLP LGSYSVGASK
VGLIAETQNL TLSNGLTSTL NLSLIPLLSY TPSSLSQSFE FGAAPISDSL SLELTNNTTQ
PISYSLRELS DAGFTPARNQ QRILVVLRTG SDDANAVTIP LSQLGYAYDQ IEFGEFATMS
LADIQAYDAV MYLGTTDTAA NNPQEAKLAE YLDAGGRLLI ADNDLGFFTR TGSFYQQYLD
ASFGSDDPNT ANYNLIGLDF MAGINPMVVD FSPDYFAPGS SSRAIFRYSD GSVGGSYIER
NGYKAIYLAV DFRNFGTGAF GERIERDIVE VSLAQLLGTT DQINWVELAP LRGEVAAGQN
SSVAVSWFPD RLTQPGTYTG TVVLAQTAVY TQTAEIPISI TITPNTSQAR LSGVVTGSGV
CSNTPAPLAN VLVTINDQQG LVTSVRTNSA GEYVVFVPTG DEYSLEFSAT DHVASSQSIT
VADGEPSVNN VQLRLDKGCL IVGPHAINTN VVFGESKTEQ LFVISTGAQA LDVAISETRA
KTVNSGDLTL TEVDYNWIEA SDGTNLNMGA YDLVNIVTPF PINLYGVSTT DLRISNNGVM
ILNNLTGLIE IFNPSLENAI HNYVIAPYWD DLDDETGGVY WKVVGEAPNR AVVVQWENRP
HYNFWDNTTF QAVLSEQGDI LFQYKDVDFN EPFLDFGASA TIGVRGTRSE IAQYSVDQPV
LRDRMALCIS QTCDSLNWLS VSPNKLSNLT GTPSSFQTVD LAIDTSNFET VGVYTTNLVL
NHTTPQPPVV VPVTVNVTLP EGYGVLNGLV ETTLVCDVNP MPLANVKITI DTEPPTVLYT
NNVGNYSRPI PAGSYNVLVE GYPGAFTSVS YQLTVEAGQT YQQDSLLRLK APCLDTSSTP
AITTTTELNM PITASFSLSN IGAGVLDWQI EERLPQQKAL AANAQRVTTS QTEAQPQLVP
ASERLLDGGF EATTISDNVA TNPYWSQDSR NFASLLCTVE CDDIMPHTGD WFIWMGGIGS
NFGTETSYFS QDFSQTSFSA GTLSFWLSVT APMDRPDDYM RVLINNNEVF RVTNADRANY
GSYTLVTVPI NEQVLGGREL HSIRFEAQIV QGGNTNFFID DLSLDLIQSC AGDAVDWLHV
EPTRGSIAAD SEQTIDVAFD PTGLAVGTHT ASLCLITNNP NRQNVRIPVS LTVEPAAIPN
YPLYLPMIMR N