Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2836 |
Symbol | |
ID | 5734717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3604201 |
End bp | 3605298 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279979 |
Product | putative membrane-associated zinc metalloprotease |
Protein accession | YP_001545602 |
Protein GI | 159899355 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | [TIGR00054] RIP metalloprotease RseP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000950908 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTA GCAGTTTGGC GTGGCTTGCG GTTATTCCCG CTTTAGGCTT TTTAGTTGTT GTTCATGAGT TGGGCCACTA TTGGGTTGGC CGCAAAATGG GCATTAAGAT CGAAGAATTT GGCATTGGGT TACCGCCACG CGCCAAAGTG TTGTTCGTGC GCAAAGGTAT TCCGTTCACG CTGAATTGGC TGCCCCTCGG CGGCTTTGTG CGGTTTGCGG GGGAAGAAGG TGGCTTCGAT GACCCCGATA GCCTGGCTTC GGCCTCGCCA CGTCGCCGCA TTCCAGTGAT GGCAGCGGGT GTAATTGCCA ACGTGATTAC CGCAATCATT ATGTTTGCGA TCATTTTTGC AATTTGGGGC TACCCCAACC TTGATAAAGT GATGGTCGCT TCAACCGATG AATTTGCCGC AAACGCTGGG TTTCAAGTCG AAGATGTGTT TGTTTCAATT AATGGAACAG CGATTAGTAC CGATGAACAG GTGCGGCTGT TGGTTGAAAC CAGTGGTGGC GAGCCATTAG ATGTCATCGT CCAACGGGCT GGCGCTGAAC AAAGCCTGAA GGTCACGCCC CAATATAGCG AAGAAGCGCA ACGCTATCGC TTTGGGGTTG GCTTAGGCAA TCCACGCGAA TCAGTTAATA TTTTTCAAGC GATCATCAAC GGGTTTACCT ATAGTTTTCG GCTATTAGGC GAAATGTTTA TGGGCTTTGC GATGTTGATC GGCGGTTTGC TGGGCACGAA TGCAGCGCCG GAAGGTGGCT TAGCTGGCCC AGTTGGCATC GCTCGCTTGA CGGGCCAAGT TGCTCGCTCG GGCTTGCGCG ATTATCTTAA TTTTACCGCC TTGCTCAGCC TGAATTTGGC CTTGATCAAC ATTTTGCCAA TTCCAGCACT TGATGGTAGC CGGATTATTT TTGCCTTGAT CGAAGCGATT CGCCGCAAGA AGATTCCACC TGAACGCGAA GCAGTTGTGC ATGCGGTTGG GATGATGATG TTGCTTGGGT TGATGCTGCT AATTACCGTT TCCGACGTGC GCAATATCAT TAGTGGCGAG CCAGCGATTA CCATGCCGCC AACCCCAACG CCAATCGTTA GACCATAA
|
Protein sequence | MDFSSLAWLA VIPALGFLVV VHELGHYWVG RKMGIKIEEF GIGLPPRAKV LFVRKGIPFT LNWLPLGGFV RFAGEEGGFD DPDSLASASP RRRIPVMAAG VIANVITAII MFAIIFAIWG YPNLDKVMVA STDEFAANAG FQVEDVFVSI NGTAISTDEQ VRLLVETSGG EPLDVIVQRA GAEQSLKVTP QYSEEAQRYR FGVGLGNPRE SVNIFQAIIN GFTYSFRLLG EMFMGFAMLI GGLLGTNAAP EGGLAGPVGI ARLTGQVARS GLRDYLNFTA LLSLNLALIN ILPIPALDGS RIIFALIEAI RRKKIPPERE AVVHAVGMMM LLGLMLLITV SDVRNIISGE PAITMPPTPT PIVRP
|
| |