Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0345 |
Symbol | |
ID | 5732255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 413746 |
End bp | 415827 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641277469 |
Product | proprotein convertase P |
Protein accession | YP_001543125 |
Protein GI | 159896878 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTAC GGATTGGGTT TATCGGTCTT ATGATCGCCC TGCTCGGATT AACGGGCGGC TTGCGTAGTT TAACCAGCGT TCAAGCAGTA CCCAACGCCC CTCAAAATCT CGCAGCCCAC CAACTTAATA CTCCTCAAGG ATTTGCGCGG ATTAGCGAAC AAGAATCAAA CGATACTCCT GCCATGGCCC AGCCAATCAT TGGCGATTCA GCCGTCATTC GCGGCTATAT CACTGCCAAT GATAGCGATT TCTTGGCAAT TCCACTTACT AGCGTCCAAC GGGTGGTCGC AGCAACTATG ACCTCGGCTT CAGCCACAAA TATTGTCGAT TCGATCCTGA CTCTCTATCA ACCTGATGGG ACAACTGTCT TGGAGCTTGA TAATGACGAT GGGATTTTTG GCGTGAGTTC TTCGGTTATC AGCTCAGCGA TTATCACGAC TACTGCAACC TACTATCTCA AGGTCACCGC CAACAATACG ACGAATAAAA TTTATTATTA CGATTTATAT GTACGGATTT TAACCAGCGA ACCAATTGTC GAACAAGAGC CAAATGAGCC AGCTCAAATG CTGCCAGCAG CTGGGATTGT CAGTGGGATT ATTAGCCAAA CTGGCGATAT TGATCGTTTT CAGCTGGCCT TGAATCCAGG CGATACGTTG TTTACAACCC TCGATATAGA CCCCGAACGT GATGGCATTA GCTGGAATGG GCGGGTAACG ATCGGGCCAT TTGGGCCAAA TATCATCACA ATCAACGATG GCAATGCGGT TTCGCCCAAT GCCGAATCGG CCCAATTAAC CGTTAAAGAA GCTGGTAATT ATATAATCAG CGTCGATAAC CTATCAGCAA TTAATAATTC GACCTACATC CTACAAGTGC TGATCATCCC AGCCGAAGAG CAAGCCAATT GTCAGACCTA TATGAGCACC GATGTCAATA AAACAATTCC ACCTGATCCT GGTATGATCA CGTCGGGATT AAATATTCCT GACAATCTCT TGATTGGCGA TTTAGACTTG ATTATTCAGC TTGATCATAC CTTTATACCT GATCTCGATG CGCAATTGAC TACGCCTGAT GGGAATATCC TTGGGCTATT CAGCGATATT GGAATCACGA CTAGTGCCCC AGCGAGCATG AACCTGATTA TTGATGATGA AGCAGCCTTA CCATCATTTC AAGCGCTTAG CGTCAATGGA TTGATCAACT TGCCTGAGGC AGCCTATCGG ATGAGTTGGT TTGATGGCCA ACGTACCCAA GGCAATTGGA CATTAACGCT CTATGATGAT GCAACTGGCG ATGGGGGCAC ATTACTCAAT TGGGGCTTGC AGGTTTGCGC TGCGCTGCCA CCCGCCGATT GTCCTGATGG CACGGTTAGC AGCACCATCT ACAGCACCGA TTTTGAAGCA AATGATGGGG GCTTTACCCA TAGCGGAACC ACTGATCCAT GGCAATATGG TCAACCAAAC TCAGCGCCAA TTGTGGGCAG TTATAGTGGG AGCAATAGTT GGAAAACCAA TTTAACTGGC AATTATCCCG CTTTAATGAA TGCCAACCTA ACGTCGCCGG CCATTGATCT TAGCGGTGTA GTTGGGCCAA TTCAGGTGCA ATGGCAACAA CGCTATCAAA TCGAATCAGC AGCATTTGAT AATTATGCAG CACAGATTGT TGGTAGTAGC AACCAAGTAC TCTTTCAGCA TCGCGATGGC GTAATGCGTG AATCAGTTGG CAATCCGCTG GTTACGGTTC AAGCAAGCAC TGGTTGGAGC CGCCAACTGC ATGATATTAG TAGCTTTGTG GGTCAGTCGA TTCAACTGCA ATTCCATATG GATACTGATT CCAGCGTTGA ACTGGCGGGT GTCGCCATTG ATGATGTGCT GGTAACTGGT TGTGTTGTGC TCCCTACGGC CACCCCAACC GAAACGCCAA CTAACACACC GACCAATACC CCCACCGAAA CCCCAACCAA TACGCCGACG GTTACGGTTA CGCCAAGTAA TACACCAATC GTTACCCCAA GCGTGACGGC TGGGCCAACG ACGATTCCAG TTTATCTGCC GTTGGTTAGT AAAGGCGAAT AA
|
Protein sequence | MRLRIGFIGL MIALLGLTGG LRSLTSVQAV PNAPQNLAAH QLNTPQGFAR ISEQESNDTP AMAQPIIGDS AVIRGYITAN DSDFLAIPLT SVQRVVAATM TSASATNIVD SILTLYQPDG TTVLELDNDD GIFGVSSSVI SSAIITTTAT YYLKVTANNT TNKIYYYDLY VRILTSEPIV EQEPNEPAQM LPAAGIVSGI ISQTGDIDRF QLALNPGDTL FTTLDIDPER DGISWNGRVT IGPFGPNIIT INDGNAVSPN AESAQLTVKE AGNYIISVDN LSAINNSTYI LQVLIIPAEE QANCQTYMST DVNKTIPPDP GMITSGLNIP DNLLIGDLDL IIQLDHTFIP DLDAQLTTPD GNILGLFSDI GITTSAPASM NLIIDDEAAL PSFQALSVNG LINLPEAAYR MSWFDGQRTQ GNWTLTLYDD ATGDGGTLLN WGLQVCAALP PADCPDGTVS STIYSTDFEA NDGGFTHSGT TDPWQYGQPN SAPIVGSYSG SNSWKTNLTG NYPALMNANL TSPAIDLSGV VGPIQVQWQQ RYQIESAAFD NYAAQIVGSS NQVLFQHRDG VMRESVGNPL VTVQASTGWS RQLHDISSFV GQSIQLQFHM DTDSSVELAG VAIDDVLVTG CVVLPTATPT ETPTNTPTNT PTETPTNTPT VTVTPSNTPI VTPSVTAGPT TIPVYLPLVS KGE
|
| |