Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0343 |
Symbol | |
ID | 5732253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 409107 |
End bp | 411299 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641277467 |
Product | proprotein convertase P |
Protein accession | YP_001543123 |
Protein GI | 159896876 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000557566 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTAC GGATTGGTTT AATCATCACG CTTTTGTTGA TGATCACCGT TGGAAAAACC CTGCAAAACC CCAGGGATGT AGCCCAAGCC CTGTCCCAAA CCCCTGATCA AACCACAATT CCGCAAGGAT TTTCCCGCAT CCAAGAGCAA GAATTTAACG ACTTACCACT CAATGCCAAT CTGTTGGTTG GCAGCAGTTT GGTGGTGCGC GGCTCAATTC AGCAAACTGA TGTTGATTAT TTTGCGCTTG ATTTAACCGC TGGTCAGCGT TTAGCAATGG CCACCATTAC TAGCGCCAGC GTCGCCTCCA GCGATACAAC CATCCATCTC TATGCTTTCG ATGGCGTAAA TAGTGATCTG ATCGAAACCG ATTTGAGCGA TGGCATTCTG AGCAACAGTT CCTCGGTGAT CAGTTCTCAG CCAATTACGC TGACCGGAAC CTACCTGATT AAAGTTGAAG GTGGCACTGC AACCACGGTT ATTCAGCCCT ACGACTTATA TGTGCGAGTG CTGAGCGAAA CTGCCAGCGA ACAAGAACCT AACGACGAAG TTGCCCAAAC CATCGATGCT CAAGCGGCAA TCAGCGGGGT AGTTTCAACC ACCAACGATC TTGATCGCTA TCAATTTAAT GTTAATCCTG GCGACACAAT TTTTGCCACC GTTGATTTTG ATCCAGAGCG TGATGGCATA ACTTGGAATG GCTTTCTCGA TATTGGCATG ATCAACAATA CCTATCTTCG GGCTAACGAT AGTAATAGTG TTTCACCCAA CGCTGAAGCC AATGTGATCA CCGTTCAACA AGCTGGCACT TACGAAATTC GCATTGGCTC ACTACTTGAG ATCGGTGTGG ATGCTAGTTA TTTGGCGCAA GTTACGATTA TTCCCGCCGC TATTCAGGCC AACTGCCAAA CCGTGATGAG TTCAGGTGCA CCACAGAATA TTGGCCCACA AGCTGGAATA ATTCAATCGA CATTAACCGT CACCCAAGCA GCCAACATTG CCGATATTGA TGTATTGCTC AATTTAGAGC ATAGCTTCAT GCCCGATTTG GATGTAACCC TGACTGCCCC CGATGGTAAT GTGATTAACC TCTTCACCGA TATTGGCAAT GTGCAGCAAC CTACCGTTAA TCTCGTGATT GATCAACAAG CTGCCTTACC ACTTGGCACG TATAACGTTT TGAGTGGCAC GCATTTTGGT CCGAAATGGA ATAGCAGCCT CGATTGGTTG GCTGGACAAC AAGCCCAAGG CCAATGGATA CTGACGATTT ATGATGATAC CGACCAAAAT GCTGGTGTAT TAAATGGTTG GGGCTTGCGG ATTTGTGGCA TGCCCACGCC AAGCGATTGT CCAGTCGGGA TGTCGCGCAG CGTGCTTTAT AGCAGCCAAT TTGAGGCCGA TAATGGCGGG CTTACTCCAG GCCCATTTGA CCAAGAATGG GTTTGGGGTA ATCGTAATAG CCCACCAATT GTTGGTGCGT ATAGCGGCGA AAATAGCTGG AATACCAATT TAACCGGAAA TTATCCTAAT AGTACCCGCA TGCAATTGCT ATCACCGCAA ATTGACTTAA CCAATGTTAC GGGGCCAATT TATGCAAGTT GGTATCAACG CTATCAGCTT GATAATAGTG TTAACGATTT TTATCAGGTA ACGGCCCATA AGCCCCAAGT TGAACAGATT CTCTTTCGCC ATCAAAGTGC TGCAATGCAA ATTAACCTCG GAAATCCTTT GGTTACGCTT GATCAAAGTA CAGGTTGGGG ACTTCAACGC CATGATCTAA GCGATTTTGC TGGCACTTCA CTCTATTTAA CCTGGGATTT CGGTAGTGAT GAGGTAGCAA GTTTTGCTGG GATTGCGCTT GATGATGTGG AAATTACTGG TTGTATTGAT CCAGCTCAAA TCACGCCAAC GAATACGCCA ACGCCAAGCA ACACGCCAAC CCTAACATCA ACTCCTAGCA ATACGCCAAC GCCAAGCAAT ACGCCAACGG CGACCGCAAC ACCAACCCAA ACCTTAACGC CAACTGAAAC GCCAACGCCA AGCAATACGC CAACGGCGAC CGCAACGCCG ACCCAGACCG AAACGCCAAC CGCGACTGAA ACACCAAGTA TCACCCCAAC GAGTACACCA AGTGTAACGG CAGATCCAAG CTTAATCCCG GTCTATCTGC CTTTAGTCAG TAAAGATAAT TAA
|
Protein sequence | MRLRIGLIIT LLLMITVGKT LQNPRDVAQA LSQTPDQTTI PQGFSRIQEQ EFNDLPLNAN LLVGSSLVVR GSIQQTDVDY FALDLTAGQR LAMATITSAS VASSDTTIHL YAFDGVNSDL IETDLSDGIL SNSSSVISSQ PITLTGTYLI KVEGGTATTV IQPYDLYVRV LSETASEQEP NDEVAQTIDA QAAISGVVST TNDLDRYQFN VNPGDTIFAT VDFDPERDGI TWNGFLDIGM INNTYLRAND SNSVSPNAEA NVITVQQAGT YEIRIGSLLE IGVDASYLAQ VTIIPAAIQA NCQTVMSSGA PQNIGPQAGI IQSTLTVTQA ANIADIDVLL NLEHSFMPDL DVTLTAPDGN VINLFTDIGN VQQPTVNLVI DQQAALPLGT YNVLSGTHFG PKWNSSLDWL AGQQAQGQWI LTIYDDTDQN AGVLNGWGLR ICGMPTPSDC PVGMSRSVLY SSQFEADNGG LTPGPFDQEW VWGNRNSPPI VGAYSGENSW NTNLTGNYPN STRMQLLSPQ IDLTNVTGPI YASWYQRYQL DNSVNDFYQV TAHKPQVEQI LFRHQSAAMQ INLGNPLVTL DQSTGWGLQR HDLSDFAGTS LYLTWDFGSD EVASFAGIAL DDVEITGCID PAQITPTNTP TPSNTPTLTS TPSNTPTPSN TPTATATPTQ TLTPTETPTP SNTPTATATP TQTETPTATE TPSITPTSTP SVTADPSLIP VYLPLVSKDN
|
| |