Gene Haur_5185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5185 
Symbol 
ID5737143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp267448 
End bp269265 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content54% 
IMG OID641282349 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_001547940 
Protein GI159901694 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.193898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTC GTTTCTTGGG AGCATTGGGC TTGCTCATGA TTGTGTTAGC ACTGCCCCGC 
AATAGCGTAG CGGAGCACCA GCGTTCATCG GCATCGCACG ATAGTGCGAT CACGATCAGC
ACCACCGCGC CAGCAGTTCC TGGCCAGTTT ATTGTCCAAT TTCGCCCGAC TGCTTCACGG
AACCAGCGTC AATCGACGCT AGCCGTACTG GGTGCATACG TCGTTCGCCG CATTGAGCCG
CTTAACCTCG ATGTGTTGGA AATACCAGCA CTGACGAAGA CTTCAACATC AACGGCTATG
GATCGAATCC TAGCGCATGC CCGTGACAAT CACGATATTG TATTTTTAGA GCCGAACTAT
ATCTACACCA CAACCTATCT CCCGAACGAC TCCGCCTATC AAAACCAATG GGCTTGGAGT
ATGGTGCGTG CACCACAAGC CTGGGATATA ACCAGAGGGA GTCCCAATGT CACTATTGCG
GTGCTTGACA CCGGGATCGA TCGGGAACAC CCTGATCTCA AAGCGAAAGT TGTGAGCCAT
GGTATCGATT TGGTGACTGC TGATGGGATC GCTCATGATG AGAATGGCCA TGGCACACAT
GTCGCAGGCA CGGCGGCAGC GGCAACGAAC AATGCGCTAG GTGGCGCTGG GATGTGTCCG
CTTTGCCAGG TACTACCAGT GCGCGTCTTG AATGCGTTTG GTTCAGGTAC ACTCGACGCT
GTTGCCGAAG GGATCATCTA TGCTGCCGAC GCGGGTGTCC AAGTTATCAA CCTCAGTCTG
GGCGGGCCGG GTTCTTTTGC GCTGCAACAC GCTGTCGATT ATGCGTGGAC TCATGGATCA
TTTCTCGCAT GTGCCGCTGG CAATAATGGC ACTGCATCGC TCGATATGGC GTATCCTGCT
GCATATGGTA ACTGCTTTGC CGTAGCGGCG ACCACGTCCA TGGATCAAAC GGCTGACTTT
TCAAACTATG GCTCTTGGGT GGAAATGGGG GCACCAGGAG TTAATATCTA CTCTGGTTGG
CTATATGGTG GCTATCATAC CATCAGTGGC ACGTCGATGG CAGCGCCCCA TGTCGCTGGA
GTGGCGGGGT TGCTGGCTTC ATTCGGTTTA AGTAATAGTC AGATCCGCAG CCGACTCTGT
ACATCCGCCG ACCGTACCGC AGCCACAGGG AGCGCCTATA CCTGTGGCCG TCTGAATGCG
TGGCGGGCAG TTTCTGCCGC AATTCCGACG ACAACACCCA CATTGAGACC AACCACGACA
AGGACTCCGA CACCAAGCGC CATTCCACAA CCTGGGGTGA CGGTAATGCC TAGTACGCCG
CCTGCCCATG CCCCGACTGT AACAGCAGTG CCAACGGTGG CTGTTCCTGG CAATACGTTG
ATTAACAGCG GATTTGAGGA TGGCAATGCC CCGTGGGAGC AGGTCTCGGC TTGGAATTAT
GCGTTGATTA TTGATGGTCC TGCTCACACG GGACGACGCG GCGCACTCTT GTCGGGATTT
ATTAACGGCG GAGATGCACT CTTCCAAACC GTTACGGTAC CTGCACATGG GAAACTAACC
TACTATCTGC GGGTGACCTC CGATGATGCC CCAACCCAAC CTCGCGATTA TCTGCGAGTT
AAGTTGTATA CGACCAACGG CATACCGATC GCGACCTTGC GCACGTGGAG CAACGCGACA
CTGTGGAATA TCTGGATCGT TGATTCTGTG GACATGAACG CGTATGCGCA TCAGACCCTG
CGTATTCAGT TTGAAACGAT CACTGATAGT CAATGGAACA CGTGGTTTGC GATTGATGAT
ATCAGTTTGA CCCCTTAA
 
Protein sequence
MNIRFLGALG LLMIVLALPR NSVAEHQRSS ASHDSAITIS TTAPAVPGQF IVQFRPTASR 
NQRQSTLAVL GAYVVRRIEP LNLDVLEIPA LTKTSTSTAM DRILAHARDN HDIVFLEPNY
IYTTTYLPND SAYQNQWAWS MVRAPQAWDI TRGSPNVTIA VLDTGIDREH PDLKAKVVSH
GIDLVTADGI AHDENGHGTH VAGTAAAATN NALGGAGMCP LCQVLPVRVL NAFGSGTLDA
VAEGIIYAAD AGVQVINLSL GGPGSFALQH AVDYAWTHGS FLACAAGNNG TASLDMAYPA
AYGNCFAVAA TTSMDQTADF SNYGSWVEMG APGVNIYSGW LYGGYHTISG TSMAAPHVAG
VAGLLASFGL SNSQIRSRLC TSADRTAATG SAYTCGRLNA WRAVSAAIPT TTPTLRPTTT
RTPTPSAIPQ PGVTVMPSTP PAHAPTVTAV PTVAVPGNTL INSGFEDGNA PWEQVSAWNY
ALIIDGPAHT GRRGALLSGF INGGDALFQT VTVPAHGKLT YYLRVTSDDA PTQPRDYLRV
KLYTTNGIPI ATLRTWSNAT LWNIWIVDSV DMNAYAHQTL RIQFETITDS QWNTWFAIDD
ISLTP