Gene Haur_3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3542 
Symbol 
ID5735401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4457652 
End bp4459568 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content50% 
IMG OID641280689 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001546306 
Protein GI159900059 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAA CCGTTCGCAT GGGCTTGCTA CTGTTGTTGG CTTTTGGATT AATGCCCTTG 
ATTCAACGCC CAGCGCAGGC AGCGACCCTC GATGATCAGA TTATTGCCGA TGAAATTCTG
GTAACGTTTA ATGATGGCTA TCGTGTCAGT GCGAATGGGG CATTCTTTGA AGGTTCTAAG
GCATTGAAAA ATGGCCTAGG CATTGGGCCA TTACTTTCGA CCCAAATGCT TGATGGGGCT
GGTCAAGTGG CAATGTTGAA GCTCCCCGCA GGCGCAAATG GCGATGCTAA AATTGCCCAA
TTGCTCAAAA ATCCAGCAGT GCGCTACGCC GAATACAACG CTGTGCGTGA GATTCTGGTC
GATCCCAATG ATGAATATTA CTCAGAACAA TGGGGCTTGC CCAAAATTGG CGCTAATGCG
GCTTGGGACA TGACCCAAGG CAACGGCTTG GTGATTGCCA TTATTGATAC TGGGGTTTCA
CCAACCCACC CTGATTTGGC TGGCCATGTG TTGCCTGGCT ACAATGCCTT GCAAAATAAT
AGCAATAGCC AAGATGATCA AGGCCACGGT ACGGCCATGG CTGGGATTGC TGCTGCCCTA
ACCAATAATG GTCAAGGTGT AGCTGGGGTT TGTTGGAACT GCCAAATCCT GCCAGTTAAA
GTGCTGAACA GTCGTGGTCA AGGCACAGCG GCAGATATTG TCGAAGGCAT GTATTGGGCT
GCTGATAATG GCGCACGCAT CATTAGTATG AGTTTGGGTG GCCCACGGGG TACGCAAGCC
GAGCAAGATG CAGTTAATTA TATTTATAGC AAAAATATTC CACTCTTTGC CTCATCTGGC
AATTCGGGCG ATGAGGGCAA CCCACGCATG TATCCAGCCG CTTTTGATCA TGTAATTGCG
GTTGGGGCAA CCACGACCCA AGATCGGGTT GCAAGTTTCT CATCGTATGG CGATTATGTG
GATATTGCTG CTCCAGGCGT AAATATTGTG ACGACTGGAT GGGATGGTGG CGATACCTAT
GAAATGGGCA GTGGCACATC GCCAGCTTGT CCGTTTGTGG CAGCAACTGC GGCCTTGGCG
CTGAGTGTTT GGCCTGAGCT TACGGTCGAT CAAATTGAAA AATTGATCAC TGGCAGTGCT
GTTGATATTT TGACTCCTGG TAAAGATGTC TATAGTGGGT TTGGCCGACT GGATACCTAC
AAAACGGTGC AAAATGCGGT TTTGCGCACG ATTCCTGGCG AACCACAACC CCAACCACCA
GCACCACCAG CGCCGCAACC ACCAACTCCA GAACCACAAC CAGGTAACCC TGCGTTTGTG
CCTGTGGGAG CGCCCCCATT GCCAGCACCG GTCGGCGAAG TCTACTTCCC TGAAACTGGC
CATAACTTGC GTGGCGAGTT CAAAAACTAC TGGGATCGTA ATGGTGGCTT GGCAGTCTTT
GGCTTCCCAA TTAGCGAAGA ATTTACCGAA CAAACTCCTG AAGGTTCGTT TACGGTGCAA
TACTTCGAAC GCCAACGCTT TGAATTTCAC CCTGAAAAAG CTGCACCCTA CAACGTGTTG
CTGGGTCGCT TAGGCGATGC TGTGCTGCGG GATCGTGGCG ACGATTGGGC CAACTTCCCC
AAAACTGGGC CAGAAAATGG CTGTCTCTAT TTCGATCAAA CCCAGCACAA AATTTGTGGC
GAGTTCCGCA AATATTGGGA AACCAATGGG CTGAATGATC CTGCTTTGAA CAAATATGAT
CGCAGCTTGC AATTGTTTGG CTTGCCATTA TCCGAGCCAA TGACCGAAAC CAATCGGGAT
GGGGCAACCG TCACGACCCA ATGGTTTGAG CGCGGCCGCT TTGAGTATCA CGAAGGTCAA
GGCGTGCTGT TAGGTTTGTT GGCCAAAGAA TATGCCAACA ATCGCAGTTG GCGCTAA
 
Protein sequence
MQRTVRMGLL LLLAFGLMPL IQRPAQAATL DDQIIADEIL VTFNDGYRVS ANGAFFEGSK 
ALKNGLGIGP LLSTQMLDGA GQVAMLKLPA GANGDAKIAQ LLKNPAVRYA EYNAVREILV
DPNDEYYSEQ WGLPKIGANA AWDMTQGNGL VIAIIDTGVS PTHPDLAGHV LPGYNALQNN
SNSQDDQGHG TAMAGIAAAL TNNGQGVAGV CWNCQILPVK VLNSRGQGTA ADIVEGMYWA
ADNGARIISM SLGGPRGTQA EQDAVNYIYS KNIPLFASSG NSGDEGNPRM YPAAFDHVIA
VGATTTQDRV ASFSSYGDYV DIAAPGVNIV TTGWDGGDTY EMGSGTSPAC PFVAATAALA
LSVWPELTVD QIEKLITGSA VDILTPGKDV YSGFGRLDTY KTVQNAVLRT IPGEPQPQPP
APPAPQPPTP EPQPGNPAFV PVGAPPLPAP VGEVYFPETG HNLRGEFKNY WDRNGGLAVF
GFPISEEFTE QTPEGSFTVQ YFERQRFEFH PEKAAPYNVL LGRLGDAVLR DRGDDWANFP
KTGPENGCLY FDQTQHKICG EFRKYWETNG LNDPALNKYD RSLQLFGLPL SEPMTETNRD
GATVTTQWFE RGRFEYHEGQ GVLLGLLAKE YANNRSWR