Gene Haur_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2203 
Symbol 
ID5734090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2799137 
End bp2800336 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content52% 
IMG OID641279344 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001544971 
Protein GI159898724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATCT TGGCGGTTAG TGCGTTTGCT GGCTCAAATT CATCATTCGC CCAGTCACGC 
GATGTAGAAA AACCCGTCGA TATTGGCGAT GCCGTTGGTC TCGACCTCAA TGCCCCTGCT
GTTCCTGGTC AGTTTGTCAT CAAATTCAAG AATTCAACCT CAAAAGCTAG CCGCGCCAAT
AGCTTGAGTG CCTTGGGCGC AGTGCAAATC GATCGGATCG AAGCGCTTGA CGCTGAAGTC
GTCGAATTCG CTAGCTTGAA GAGCAACGAT AGTTTGGCAA TGCGCCAAGC CATGGTTGAA
AGCTTGCTCA AAGATGGCAA CATCGAATAT GCCGAACCCA ACTTTATCTA TACTTCAACC
TACACTCCCA ACGACCCAGG TCGTAGCTCA CAATGGGCAT GGGGTGTAAC CCAAGCATAC
ACTGGTTGGG ATATCACGCG CGGTAGCAGC AGCGTTGTCG TTGCGGTTGT TGACACTGGG
ATTCAAAGCA CTCACCCTGA TTTGGATGCC AAAATTGTCG CTGGCTACGA CTACATCGAT
AATGACTCAA CGCCAAATGA TGGAAATGGC CACGGTACGC ACGTCGCTGG GACGGTTGCT
GCTGAAACCA ACAATAGCAC TGGTGGCGCA GGAACCTGCC CCAACTGTCG CTTGATGGGC
GTTCGCGTCT TGAATAACAG CGGTAGCGGT ACCTTGGCTG GTGTGGCCAA TGGCATCACC
TACGCTGCTA ACAATGGCGC AAAGGTCATC AACTTAAGCC TTGGTGGCGG TGGTTCAACG
GCCTTGCAAA ATGCCGTCAA CTACGCTTGG GGCCGTGGAG TATTCTTGGC TTGTGCCGCT
GGTAACAGCA ACACCTCAAG CACCACCAGC GCTTACCCAG CTGCGTATAC CAACTGTTTT
GCGGTTGCAT CAACGACTTC AACCGATGCC CGCTCATCAT TCTCAAACTA TGGTACATGG
GTCGAAGTGG CTGCCCCTGG TTCGAGCATC TACTCAACCT GGATTAACAG TGGCTACAAC
ACGATCAATG GTACCTCAAT GGCTACCCCA CACGTTGCCG GTTTGGCTGG CTTGTTGTCA
TCACAAGGCT TGACCAACAG CCAAATCAAG AGCAAAATCT GCTCAAGCTC CGACCAAATT
AGCGGGACTG GCACGCGCTG GACTTGCGGT CGGATCAACA TCTACAAAGC TGTTCAATAG
 
Protein sequence
MLILAVSAFA GSNSSFAQSR DVEKPVDIGD AVGLDLNAPA VPGQFVIKFK NSTSKASRAN 
SLSALGAVQI DRIEALDAEV VEFASLKSND SLAMRQAMVE SLLKDGNIEY AEPNFIYTST
YTPNDPGRSS QWAWGVTQAY TGWDITRGSS SVVVAVVDTG IQSTHPDLDA KIVAGYDYID
NDSTPNDGNG HGTHVAGTVA AETNNSTGGA GTCPNCRLMG VRVLNNSGSG TLAGVANGIT
YAANNGAKVI NLSLGGGGST ALQNAVNYAW GRGVFLACAA GNSNTSSTTS AYPAAYTNCF
AVASTTSTDA RSSFSNYGTW VEVAAPGSSI YSTWINSGYN TINGTSMATP HVAGLAGLLS
SQGLTNSQIK SKICSSSDQI SGTGTRWTCG RINIYKAVQ