Gene Haur_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2836 
Symbol 
ID5734717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3604201 
End bp3605298 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content51% 
IMG OID641279979 
Productputative membrane-associated zinc metalloprotease 
Protein accessionYP_001545602 
Protein GI159899355 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000950908 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTA GCAGTTTGGC GTGGCTTGCG GTTATTCCCG CTTTAGGCTT TTTAGTTGTT 
GTTCATGAGT TGGGCCACTA TTGGGTTGGC CGCAAAATGG GCATTAAGAT CGAAGAATTT
GGCATTGGGT TACCGCCACG CGCCAAAGTG TTGTTCGTGC GCAAAGGTAT TCCGTTCACG
CTGAATTGGC TGCCCCTCGG CGGCTTTGTG CGGTTTGCGG GGGAAGAAGG TGGCTTCGAT
GACCCCGATA GCCTGGCTTC GGCCTCGCCA CGTCGCCGCA TTCCAGTGAT GGCAGCGGGT
GTAATTGCCA ACGTGATTAC CGCAATCATT ATGTTTGCGA TCATTTTTGC AATTTGGGGC
TACCCCAACC TTGATAAAGT GATGGTCGCT TCAACCGATG AATTTGCCGC AAACGCTGGG
TTTCAAGTCG AAGATGTGTT TGTTTCAATT AATGGAACAG CGATTAGTAC CGATGAACAG
GTGCGGCTGT TGGTTGAAAC CAGTGGTGGC GAGCCATTAG ATGTCATCGT CCAACGGGCT
GGCGCTGAAC AAAGCCTGAA GGTCACGCCC CAATATAGCG AAGAAGCGCA ACGCTATCGC
TTTGGGGTTG GCTTAGGCAA TCCACGCGAA TCAGTTAATA TTTTTCAAGC GATCATCAAC
GGGTTTACCT ATAGTTTTCG GCTATTAGGC GAAATGTTTA TGGGCTTTGC GATGTTGATC
GGCGGTTTGC TGGGCACGAA TGCAGCGCCG GAAGGTGGCT TAGCTGGCCC AGTTGGCATC
GCTCGCTTGA CGGGCCAAGT TGCTCGCTCG GGCTTGCGCG ATTATCTTAA TTTTACCGCC
TTGCTCAGCC TGAATTTGGC CTTGATCAAC ATTTTGCCAA TTCCAGCACT TGATGGTAGC
CGGATTATTT TTGCCTTGAT CGAAGCGATT CGCCGCAAGA AGATTCCACC TGAACGCGAA
GCAGTTGTGC ATGCGGTTGG GATGATGATG TTGCTTGGGT TGATGCTGCT AATTACCGTT
TCCGACGTGC GCAATATCAT TAGTGGCGAG CCAGCGATTA CCATGCCGCC AACCCCAACG
CCAATCGTTA GACCATAA
 
Protein sequence
MDFSSLAWLA VIPALGFLVV VHELGHYWVG RKMGIKIEEF GIGLPPRAKV LFVRKGIPFT 
LNWLPLGGFV RFAGEEGGFD DPDSLASASP RRRIPVMAAG VIANVITAII MFAIIFAIWG
YPNLDKVMVA STDEFAANAG FQVEDVFVSI NGTAISTDEQ VRLLVETSGG EPLDVIVQRA
GAEQSLKVTP QYSEEAQRYR FGVGLGNPRE SVNIFQAIIN GFTYSFRLLG EMFMGFAMLI
GGLLGTNAAP EGGLAGPVGI ARLTGQVARS GLRDYLNFTA LLSLNLALIN ILPIPALDGS
RIIFALIEAI RRKKIPPERE AVVHAVGMMM LLGLMLLITV SDVRNIISGE PAITMPPTPT
PIVRP