Gene Haur_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3587 
Symbol 
ID5735448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4514514 
End bp4516313 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content53% 
IMG OID641280736 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001546351 
Protein GI159900104 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAG AACATCGCAG AGGATTTTTT CGCAGTTTAT TGCTGGCAAG CCTCTTTTTG 
AGTGCGATTG GCCTTGCACA GCCTGTTGCA ACCACCCAAC CGCAATCAAT GTTAGTGCAA
GGTGCTGCGC TAAGCAACCA AGCTAGGGCA ATTCAAGCAG TTGGCGGCAC TGTCGAACGC
CAATTTGCCC GCTTTGAAAC CAGCGTAGCC CAATTAACCA AACCACAAAT TGCTCAACTG
CAAGCGCAAG GCTTGCGGGT TTTTGCCGAT CAAGCGGTTG CTAGCTCAGC GCAAGGAGCT
GTTGATCAGC CAATTGCTGT GCAGGCTGGC CCCAAAAGTA GCAACCCTGC GCCATTGCAA
TATCCATCGA TTGTAACCGG CGCTGATGTG GTGCAACGCC GTGGTATCGA TGGACGCGGC
GTGACCGTGG CAGTGATCGA TTCGGGTTTG CCAGCGATCG AACGACCTGA GCGTTGGCAG
CGCATCGATA ACACGACTGC TCGCTACAAC CAAGGCAGCC GTTTTTTGAT TTATAAAGAC
ATGTTGGCAA GCAACCAAAT TACGAATAGC AGCGACCCCT ACGGCCATGG AACGCATGTA
TTTGCCACTT TAGCCGATAA TCGCGCTTTG CCAGCAGGCT ATAGCGGAGC CAAAGTCGGG
ATTGCCCCAG GCGCGAATTA TGTGGTCGTT CGGGCCTTAA ATTCGCAAGG CCAAGCCTCG
TATAGCACGA TTATTGCGGC GATTGATTGG GTGATTGAGC ATGCTGACGA GTACGATATT
CGGGTGCTGA ATCTCTCGTT GCAAGCCGAA GTGATTGCGC CCTACTGGTA TGATCCGCTC
AATCAAGCGG TGATGCATGC TTGGAATGAG GGCATCACGG TGGTGGCAGC GGCTGGCAAC
ACTGGCCCAA ATCCTGCCAC GATTATGGCT CCAGCCAACG TGCCCTATAT CATCAGTGTC
GGGGCAATTC GGCCTGCAAT CTACAACGAC AATGGCAGCG ATAGTTTAGC AGCCTATTCA
TCGGCTGGGC CAACCGAAAG TCGTTTTGTT AAGCCCGATG TGGTGATTGC TGGCTCGCGG
GTCATTGCAC CCCTGCCCGC CGATAGTGTG TTAGCTAATT CTGGGTCCGC AGGTTTGGTT
AGCGAACGCG CCAAACTCGA ATGGGCTGAT TTGAAAACCT CGCAACAGCT TAATTATTAT
GCCTTGAGTG GCACATCGAT GGCCGCCGCC GAAGTCAGTG GAATTGTCGC CTTGTTGTTG
CAAGATGAGC CAAATTTGAC CAATAACCAA ATTAAGGCTC GTTTGACTGG CACGGCCCAA
CTTGCCACTT TGGATGATGG TCAAGCCGCT TACAGCACGT GGCAACAAGG CGCAGGCAAA
GTTGTGCCAA CTGCGGTGAT CGATGGTAGC AACACTGGCG CTGCCAACGC GGGAATGGAT
TTGCCGACCG ACCTGATTGT GAGCAGCAAC CCCGATGATA CCAGCAACCA TTACCAAGGC
TCGACTGAGT ATGATGCGGT GACCAACACT TTTACTGATA ACACCACGAC CTATAGCTCA
ACCCTTGCGA CCAGCTACAG CACCTGGGCG GGGAGCTATA GTACATGGGC TGGCAGTTAT
CTAACATGGG CTGGTAGTTA CAGCACGTGG GCTGGCAGCT ACAGCACGTG GGCTGGCAGT
TACAGCACGT GGGCTGGCAG TTACAGCACG TGGGCTGGCA GTTATAGCAC GTGGGCTGGC
AATAACGCTG CTGCGAGTGG AACCACCAAT GTCAACGCTG GAAGCCTCGT GAGCGACTAA
 
Protein sequence
MDREHRRGFF RSLLLASLFL SAIGLAQPVA TTQPQSMLVQ GAALSNQARA IQAVGGTVER 
QFARFETSVA QLTKPQIAQL QAQGLRVFAD QAVASSAQGA VDQPIAVQAG PKSSNPAPLQ
YPSIVTGADV VQRRGIDGRG VTVAVIDSGL PAIERPERWQ RIDNTTARYN QGSRFLIYKD
MLASNQITNS SDPYGHGTHV FATLADNRAL PAGYSGAKVG IAPGANYVVV RALNSQGQAS
YSTIIAAIDW VIEHADEYDI RVLNLSLQAE VIAPYWYDPL NQAVMHAWNE GITVVAAAGN
TGPNPATIMA PANVPYIISV GAIRPAIYND NGSDSLAAYS SAGPTESRFV KPDVVIAGSR
VIAPLPADSV LANSGSAGLV SERAKLEWAD LKTSQQLNYY ALSGTSMAAA EVSGIVALLL
QDEPNLTNNQ IKARLTGTAQ LATLDDGQAA YSTWQQGAGK VVPTAVIDGS NTGAANAGMD
LPTDLIVSSN PDDTSNHYQG STEYDAVTNT FTDNTTTYSS TLATSYSTWA GSYSTWAGSY
LTWAGSYSTW AGSYSTWAGS YSTWAGSYST WAGSYSTWAG NNAAASGTTN VNAGSLVSD