Gene Haur_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0516 
Symbol 
ID5732432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp598973 
End bp600397 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content54% 
IMG OID641277643 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001543293 
Protein GI159897046 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGAT CTGGACGCTG GACAATTCGT CGGGTACGGT CTGCGGTCAT TGCAACGGTG 
TTGAGCACAT CAGTATTGTT AGGCGGCTAC GCTGCCTCAG CCAAAGACAA CAAAAAAGTC
GAAGTTTATC CGCTCCCTGT TGTTGACGAA AAGCAACCTG GTTCCGAACA ATTGCCCCCA
CCAGATAAAA TTGTCGGTGG CTCGGCGGCT ACTGCTGGTG AATTCCCCTG GCAAGCTCGG
ATAGCTCGTA ACGGCAGCCT ACATTGTGGT GGCTCGTTGA TTGCTCCCCA ATGGGTTTTG
ACTGCTGCGC ACTGTGTTCA AGGCTTCTCG GTATCATCAC TCAGCGTGGT GATGGGCGAC
CATAACTGGA CGACCAACGA AGGCACCGAA CAAAGCCGCA CAATTGCTCA AGCAGTTGTT
CACCCAAGCT ACAATTCATC AACCTACGAC AACGACATTG CTTTGTTGAA ACTCAGCAGC
GCTGTAACCC TCAACAGCCG CGTTGCCGTG ATTCCGTTCG CCACCAGCGC TGATAGCGCC
TTGTACAACG CTGGCGTTGT TTCAACCGTC ACTGGTTGGG GCGCGTTGAC CGAAGGTGGT
TCATCACCAA ACGTCTTGTA CAAAGTGCAA GTGCCTGTGG TTTCAACCGC TACCTGTAAC
GCCTCAAACG CCTACAACGG CCAAATCACT GGCAACATGG TGTGTGCTGG CTACGCTGCT
GGCGGCAAAG ACTCATGCCA AGGCGATAGC GGTGGTCCAT TCGTCGCTCA AAGCAGCGGC
TCATGGAAAC TCAGCGGTGT TGTGAGCTGG GGCGATGGTT GTGCCCGCGC CAATAAGTAT
GGCGTGTACA CCAAAGTTTC CAACTACACC AGCTGGATCA ACAGCTATGT CGGTACGGTA
ACCCCAACCA GCACGCCAGT GCCAGGTACT CCAGTGCCAA CCAGCACGCC AGTACCAGGT
ACTCCAGTGC CAACCAGCAC GCCAGTGCCA GGTGGTAGCT TGCAAAATGG TGGCTTCGAA
AGCAGCGCTA GCTGGGTTCA ATCACCAAGC AATATCATCT CAACCACTCG CCCACGCAGC
GGCTCGTATA GCGCCTTCTT GGGTGGCTAC AACAGCGGCA CCGATAACAT CTATCAAAGC
GTGACGGTTC CATCAAATGG TGTGTTGCGC TACTACTGGT ACATGAGCAC CCAAGAAAGT
GGCAGCACTG TCTACGACCG CTTGTATGTT CGCCTCTACA ACAGCAGCGG CAGCTTGATC
ACCACCTTGC GCACCTGGAG CAACGCGAGC ACCAAGAACA CTTGGACGCT TGACACGATT
AGCCTCTCAG CCTACGCTGG CCAAACCGTG CGTGTCCAAT TCGTTGGCAC TACCGATAGC
AGCTTGACCA CCTCGTTCTT CGTGGACGAT GTAACTCTGC AATAA
 
Protein sequence
MERSGRWTIR RVRSAVIATV LSTSVLLGGY AASAKDNKKV EVYPLPVVDE KQPGSEQLPP 
PDKIVGGSAA TAGEFPWQAR IARNGSLHCG GSLIAPQWVL TAAHCVQGFS VSSLSVVMGD
HNWTTNEGTE QSRTIAQAVV HPSYNSSTYD NDIALLKLSS AVTLNSRVAV IPFATSADSA
LYNAGVVSTV TGWGALTEGG SSPNVLYKVQ VPVVSTATCN ASNAYNGQIT GNMVCAGYAA
GGKDSCQGDS GGPFVAQSSG SWKLSGVVSW GDGCARANKY GVYTKVSNYT SWINSYVGTV
TPTSTPVPGT PVPTSTPVPG TPVPTSTPVP GGSLQNGGFE SSASWVQSPS NIISTTRPRS
GSYSAFLGGY NSGTDNIYQS VTVPSNGVLR YYWYMSTQES GSTVYDRLYV RLYNSSGSLI
TTLRTWSNAS TKNTWTLDTI SLSAYAGQTV RVQFVGTTDS SLTTSFFVDD VTLQ