Gene Hoch_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5067 
Symbol 
ID8547478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6987741 
End bp6988844 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID646389743 
Productmembrane-associated zinc metalloprotease 
Protein accessionYP_003269448 
Protein GI262198239 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.336988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTGC TAGGCGCCAT TCTCGCCCTC AGCTTGATCA TTGTCGTCCA CGAGGCCGGC 
CACTACCTGG TCGCCAAGTG GTGCAAAATG CGCGTGGATC GCTTCAGCAT CGGGTTTGGC
CCGGCGATAG CGAGTTGGAA CCGCGGCGAA ACCAAGTTTC AGCTCGCGCC GATTCCGTTC
GGCGGGTTCG TCGAGATCCG CGGCATGAAC ATCGCCGAGG ACGTCCCACC AGACGATCCC
TACGCGTATC CCAACCGTCC CACCTGGCAG CGCTTCCTGA CCATCTTTGC CGGTCCCGGG
ACCAACTACC TGTTCGCGAC CGTGCTCGCC TTCGTGCTGT TCGCGGTCGC CGGCGTGCCC
AGCGGCACCT CGCACTACGT GGTCAACGGC GTGGCCAGCG AGGGCTTCGA CGCCATCGGC
AAGCTCGAAC CCGGCGACCA GATCATGGCG GTGCAGCGCG CGAGCGACAG CGAGCCGCAG
CCGGTGTACG TGCTCCTGGA CGGCAAGCCG GCCGAGAAGT CGCTGAGCCA GCTCGTGCAC
GAGAGCCAGG GCGCGCCGAT GCAGGTCGAC GTGCTGCGCG ACGGTCAGGC CATGAGCTTC
TCGATCACGG CCCGCCCCGA CCAGGGGCAG ATCAACAAGG AGACCGGCGA GCCTCAGTAC
CGCCTGGGTA TCAGCCTCGA GACCACGCGC GAGCGCGTCG GCGTCGGCCT CGTCGCCGCC
GTGGGCTACG CGGTCGAGTT CCCCATCGAG CACACCAAGC TCGCGCTCGC CAACCTCTAT
CAGATGATCA TGGGCGAGGT CGAAGCCGAG CTGACCGGGC CCGTGGGCAT CGCCGACGTC
ATCCAGCAGT CGATCCGGGT GGGCTGGATC GACGCGATGG CGATGCTGAT CCTGCTCAAC
GTGCTCGTCG GATTGTTCAA CCTGCTGCCC ATTCCGGCGC TCGACGGCGG CCGTCTGGTG
TTCTTGATCT ACGAGATGGC GACGCGTCGC CGGCCCAATC CGCGCTTCGA GGCCACGGTG
CACATGGTCG GCATCATGGT GCTGCTGGTG GTGCTGGTGG CGGTCACGGT CAAGGACATC
GCGCGCATCA TCGAGCGCCT GTAG
 
Protein sequence
MSVLGAILAL SLIIVVHEAG HYLVAKWCKM RVDRFSIGFG PAIASWNRGE TKFQLAPIPF 
GGFVEIRGMN IAEDVPPDDP YAYPNRPTWQ RFLTIFAGPG TNYLFATVLA FVLFAVAGVP
SGTSHYVVNG VASEGFDAIG KLEPGDQIMA VQRASDSEPQ PVYVLLDGKP AEKSLSQLVH
ESQGAPMQVD VLRDGQAMSF SITARPDQGQ INKETGEPQY RLGISLETTR ERVGVGLVAA
VGYAVEFPIE HTKLALANLY QMIMGEVEAE LTGPVGIADV IQQSIRVGWI DAMAMLILLN
VLVGLFNLLP IPALDGGRLV FLIYEMATRR RPNPRFEATV HMVGIMVLLV VLVAVTVKDI
ARIIERL